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1. Introduction 

1.1. Three surprises of high dimensions. This paper develops asymptotic meth- 
ods to count faces of random high-dimensional polytopes; a seemingly dry and un- 
promising pursuit. Yet our conclusions have surprising implications - in statistics, 
probability, information theory, and signal processing - with potential impacts in 
practical subjects like medical imaging and digital communications. Before involv- 
ing the reader in our lengthy analysis of high-dimensional face counting, we describe 
three implications of our results. 

1.1.1. Convex Hulls of Gaussian Point Clouds. Consider a random point cloud of 
n points Xi, i = l,...,rt, sampled independently and identically from a Gauss- 
ian distribution in R'^ with nonsingular covariance. This is a standard model of 
multivariate data; its properties are increasingly important in a wide range of ap- 
plications. At the same time, it is an attractive and in some sense timeless object 
for theoretical study. 

Properties of the convex hull of the random point cloud X ~ {xi} have at- 
tracted interest for several decades, increasingly so in recent years; there is a now- 
voluminous literature on the subject. The results could be significant for under- 
standing outlier detection, or classification problems in machine learning. 

A classical asymptotic result, holds that if the dimension d stays fixed, while 
the number of points n — > oo, the convex hull has ~ log*-''~^-'^^(n) vertices, and 
the remaining points of X are all of course in the interior of the convex hull. 

The modern trend in statistics and probability is to consider the case where both 
the number of dimensions d and the sample size n are large '19''21'. In that case, the 
intuition fostered by the classical fixed-dimension asymptotic is wildly inaccurate. 
Rather than the relatively few extreme points that we saw in the fixed-dimension 
asymptotic, there are now many extreme points, many edges, etc. - in fact, the 
maximal number conceivable. More precisely, let = k^iX) denote the largest 
number k such that 

• Each point Xj IS Bi vertex of conv(A'); 

• Each line segment [xj, Xi], j ^ i is an edge of conv(A^); 

• ... 

• Every fc + 1 distinct points of X span a fc-face of conv(A'). 
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Then, not only is defined and positive, it is rather large. Section fT.ll below gives 
a corollary of our main results saying roughly that, for e > 0, with overwhelming 
probability for large d, 

(1-1) fcd> , 1 

2elog(n/rf) 

Not only are no points of X 'inside' conv(A'), it is also true that no edge between 
any pair of points crosses the interior of conv(A'), etc. This is about as far from 
low-dimensional intuition as it is possible to get! 

1.1.2. Signal Recovery from Random Projections. Suppose we are interested in a 
vector xq G which, although unknown to us, is known to be fc-sparse - i.e. we 
know that it has at most k nonzeros when represented in the standard basis. We 
are allowed to ask some number n of 'questions' about xq, each question coming in 
the form of a projection y.i = (a^, xq) on a vector a.i S R^. How big should n be so 
that we may recover xq, i.e.: "How many questions suffice to recover a fc— sparse 
vector" ? 

Obviously, N suffice (simply ask for the N coordinates in the standard unit 
vector basis), but in cases where xq is very sparse, fc <C -/V, many fewer questions 
will do. Indeed, n = 2fc + 1 suffice; simply take the a.; as independent random 
vectors with iid Gaussian entries. (The matrix A having Oi for rows will then have 
its columns in general position, which implies that there cannot be two fc-sparse 
vectors xq and xi both answering the questions in the same way Although such 
a random set of questions determines xq uniquely, the task of actually recovering 
Xq from such information is daunting; in general, one must enumerate the fc-subsets 
of columns of A looking for a subset which can be combined linearly to generate y. 

A more useful question: how many questions are needed in order to permit 
computationally tractable recovery of Xq7 We will give precise and simply-stated 
results for reconstruction using standard linear programming. 

Generate n questions 'at random' by simply taking for A an n by matrix with 
iid Gaussian A'^(0, 1/n) entries. Obtain a vector of n measurements y — Axq where 
a;o has fc nonzeros. Consider the convex optimization problem 

(Pi) min ||a;|| 1 subject to = Aa;. 

If n is large enough relative to fc, then the solution xi to (Pi) is very likely to be 
exactly xq. Section [731 below gives a corollary of this paper's main results showing 
that, for N much larger than fc, and both large, this exact equality happens as soon 
as 

(1.2) n>2fc-log(Ar/n)(l+Op(l)). 

Thus if we sample not 2k + 1 projections but instead roughly 2fc log(A^/r7,) we 
can efficiently reconstruct the fc-sparse vector; and this can be far fewer than the 
A^ samples superficially required. 

1.1.3. How many gross errors can we efficiently correct? Consider a stylized prob- 
lem of transmitting m 'pieces' of information - i.e. m numbers - with immunity 
to occasional transmission errors. A standard strategy is encode the data to be 
transmitted as a block of N > m numbers, and to decode the received block. Let 
P be an TO X A^ matrix. Given a vector u e R™ to be transmitted, encode it as 
V = B'^u e R^ and transmit. The receiver measures w — v + z where w G R^ and 
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z represents transmission errors. The receiver in some way decodes the TV numbers, 
hoping to produce the m original entries in u. 

The nonzeros in z represent transmission errors; call the number of nonzeros fc. 
How many errors can such a scheme tolerate? In principle, if B is well-chosen and 
N — m — 2k+\, it is possible to correct k errors. To do so, the receiver executes a 
combinatorial search through all possible locations of the k nonzeros among the N 
received values, to see which error pattern is consistent with the underlying model. 
Unfortunately, such a brute-force scheme is impractical for all but the smallest N . 
More to the point is the question of how many errors a practical decoding scheme 
can tolerate. 

A simple decoding scheme based on (Pi) can be used if the encoding matrix B 
is generated in a specific way. Let [/ be a random orthogonal matrix, uniformly- 
distributed on 0{N), and partition it as C/ = where the encoding matrix B 

is mx N and the generalized checksum matrix A is nx N , with m + n = N. Given 
the received data w, form the generalized checksum y — Aw. Solve the instance of 
(Pi) given by {y,A), obtaining xi. The generahzed checksum is used to estimate 
the error pattern, and the optimization result xi is our estimate of z. Reconstruct 
by subtracting this estimate of the error out of the received message, and projecting 
down from to R™: ui ~ B{w — Xi). 

As (Pi) is a standard convex optimization problem, this can be considered com- 
putationally tractable. How many errors can this scheme tolerate? 

To answer this quantitatively, let us call R = n/N the rate of the code, and 
consider the regime of high-rate coding, where R is nearly one. In this regime we 
don't want to expand the block length by very much in our encoding, but we still 
want to gain some immunity to errors. 

The results just stated in Section [1.1.21 and a corollary in Section 6.4 below, 
together imply the following. Consider a sequence of problems {n,Nn) with P„ = 
n/Nn — > 1 sufficiently slowly. Suppose the error vector z contains k nonzeros and 
is stochastically independent of {A,B), so the sites and signs of the nonzeros are 
random and independent of A. There is perfect recovery ui = u provided k < k^ , 
where k^ is a random variable dependent on {A, B), and obeying 

(1.3) C = ^/(21og(l/(l-P„)))(l + 0p(l)), 71 -.ex. 

In short, if we use very long blocks, and stipulate a very small loss in transmission 
rate i?„ = 1 — e„, with e„ small, we can use linear programming to correct about 
n/21og(e„) errors. 

Results to be stated below - see Section 17.21 - imply an even more impressive 
result. Again, consider a sequence of problems {n, N^) with P„ — n/iV„ 1 
sufficiently slowly. Suppose the error vector z contains k nonzeros at arbitrary sites 
and with arbitrary nonzeros. There is perfect recovery ui = u provided k < k^, 
where k^ is a random variable dependent on {A, B) and obeying 

(1.4) A:f > n/(2elog(0F/(l-P„))(l + Op(l))), ti ^ oo. 

In short, if we use very long blocks, and stipulate a very small loss in transmission 
rate P„ = 1 — e„, with e„ small, we can use linear programming to correct all 
possible patterns of about n/2elog(e„) errors. 

Note that the sites and values of the errors can here be arbitrary; they can be 
chosen by a malicious opponent who knows v,B,A, and ul The noise can thus be 
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arbitrarily more energetic than the signal, can be carefully chosen, and still it is 
completely suppressed. In contrast to H1.3|l . which requires errors to be in random 
positions, (|1.4|l allows them to occur in bursts or in any other malicious patterns. 

1.2. Random Projections of Convex Polytopes. The surprises (|l.l|l . (|1.2|l . 
(|1.3(I . H1.4|I are facets of a phenomenon which makes appearances throughout the 
mathematical sciences, in the fields of statistics, probability, information theory, 
and signal processing. The phenomenon concern thresholds in the behavior of face 
counts of random high-dimensional polytopes. We now develop the terminology 
and framework for those results, only later explaining how they imply Hl.l|l - H1.4() . 

Let T = T^^^ denote the standard simplex {x : J^i^i — Ij^i — ^1 ^.nd let 
C — denote the standard cross-polytope in R^, i.e. the collection of vectors 
{x : \\x\\i < 1}. (Also called the ^^-ball). Here and in what follows, let Q be either 

Let A he an n X N random matrix with Gaussian iid entries. The image AQ is 
a convex subset of R"; in fact, a convex polytope. We are interested in the case 
n < N , so that multiplication by A lowers the dimension. 

It makes sense to count the number of fc-dimensional faces of Q and AQ, < 
k < n. In general AQ will have fewer faces than Q. More precisely, if we enumerate 
the fc- faces F of Q, each AF will either be a face of Q or will belong to the interior 
of AQ. More picturesquely, some of the faces of Q 'survive projection', while some 
of the faces 'do not survive'. 

1.2.1. Typical Faces of Random Polytopes. The fc-dimensional faces of Q make a 
finite set, Tk{Q) (say), by placing uniform measure on this set, we may speak of 
typical faces, as follows. 

Definition 1.0. Consider a sequence of problem sizes {n,Nn). Suppose that, for 
a given projector A, a property V — ViF-jA) of the projected face AF holds, 
at a fraction n^.n — '!^k,n{A) of fc-faces F e Tk{Q). Suppose that the random 
variable 7r„ — s-p 1 as rt — > cxd. Then we say that (asymptotically) the typical k-face 
F e Tk{Q) has property V . 

We now consider the fate of the typical fc-face of A under the projection Q ^ AQ. 
In the following statements, fix e > 0. 

• Let F be a typical k-face ofT'^"^. Is AF a face ofAT^'^ ? The answer is 
yes, provided N and k are both large and n > 2klog{N/n){l + e), and no 
provided n < 2klog{N/n){l — e). 

• Let F be a typical k-face of . Is AF a face of AC'^ ? The answer is 
yes, provided N and k are both large and n > 2k\og{N/n){l + e), and no 
provided n < 2klog{N/n){l — e). 

In short, there are well-defined thresholds at which typical k-faces of the sim- 
plex and the cross polytope begin to get 'swallowed up' under random lowering of 
dimension. 

1.2.2. All Faces of Random Polytopes. We now consider the fate of the whole col- 
lection of /c-faces simultaneously. 

• For every k-face F ofT'^^^, is AF also a k-face of AT^~^? The answer 
is overwhelmingly likely to be yes, provided N and k are both large and 
n > 2e/clog(7V/(n-2V^))(H-e). 
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• For every k-face F of , is AF also a fc-face of AC^ ? The answer is 
overwhelmingHy hkely to be yes, provided N and k are both large and 
n > 2ek\og{N/{n ■ V^))(l + e). 

Below certain specific bounds on the face dimension k, no faces are lost in projection. 

1.3. Background: Proportional Growth Setting. Our promised applications, 
such as and H1.2|l . were stated merely with n and N (respectively k and N) 
both large. However, the backbone of our analysis (and the bulk of prior scholarly 
work) concerns a setting in which {k,n,N) are large but also comparable in size. 
We consider this case first and later extend our results to a more general setting. 

Definition 1.1. A sequence of triples ((/e„, n, Nn) : n = uq, uq + I, . . .) will be said 
to grow proportionally if there are S € (0, 1) and p G (0, 1) so that 

(1.5) kn/n^p, n/Nn^S, n —+ oo. 

We omit subscripts n on k and N unless they are absolutely necessary. 

There are several significant prior results concerning thresholds for face counts 
in the proportional-growth setting. 

1.3.1. Weak Thresholds. Consider first the question whether the typical face sur- 
vives projection. 

• Simplex. There is a function : [0, 1] [0, 1] with the following property. 
In the proportional growth setting with p < p'^{6), we have 

£;/fc(Ar^-i) = A.(T^-i)(l-o(l)), 0<k<pn, n^oo; 

while if p > p^{S) we have that for some e > and some sequence (fc„) 
with fc„ < pn, 

EMAT^-')<MT''-')il-e), n^oo. 

Informally, the fraction of faces lost: 

(/fc(T^-^) - £;A(AT^-i))//fc(T^-i), 

is either negligible or non-negligible depending on which side of p^{S) the 
fraction k/n sits. In words, for fc„ somewhat below the threshold n ■ p^{S) 
the typical fc„-face of the simplex survives projection into n dimensions; 
but for kn somewhat above the threshold this is no longer true. 

• Cross-Polytope. There is a function p^ : [0, 1] [0, 1] with the following 
property. In the proportional growth setting with p < p^{S), we have 

Efk{AC'')=MC^){l-o{l)), 0<k<pn, n^oo; 

while if /9 > p^{S) we have for some e > and some sequence (fc„) with 
kn < pn, 

EMAC^)<fkiC''))il-e), n^oo. 

Again, for kn somewhat below the threshold n ■ p^{6) the typical fc„-face 
of the cross-polytope survives projection into n dimensions; but for some 
kn at or above the threshold this is no longer true. 
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In view of these results, the square < S, p < 1 may be decorated with a phase 
diagram. The two pvK-functions mark phase transitions; there are two phases for the 
property "the projected polytope has approximately as many faces as the original" . 
Below the transitions, the property holds asymptotically for large n, while above 
the transitions, the property fails asymptotically for large n. Both transitions are 
depicted in Figure ITtI which displays a phase diagram in (S, p) plane. In the region 
below these curves, typical faces are not lost, in the region above those curves, 
typical faces are lost. To interpret these curves, note that ii S = 1/2 so we are 
lowering dimension by 50%, and if n is large, then the typical fc-face of the simplex 
survives, for k/n < .5581, while the typical fc-face of the cross-polytope survives, 
for k/n< .3848. 

Vershik and Sporyshev '33' pioneered study of the proportional growth setting, 
and proved the existence of what we call here the weak threshold for the Simplex 
case. The weak threshold function p^ was introduced using our notation and 
carefully studied by the authors in |12| . where numerical methods were developed 
for its calculation and display. The weak threshold for the cross-polytope was 
introduced in JD], calculated, and displayed. 

1.3.2. Strong Thresholds. We now ask when the difference between fk{AQ) and 
fk{Q) is small in absolute, not relative, terms. 

• Simplex. There is a function p'g : [0, 1] i— > [0, 1] with the following property. 
In the proportional growth setting with p < p'g{S), we have 

S/fc(Ar^-i) = A(T^-i)-o(l), 0<k<pn, n^^. 

Thus, for fc below n ■ p1j{S) there are on average as many fc-faces of the 
projected simplex as the original simplex. On the other hand, if p > p^ ((5), 
then there is a sequence (fc„) with fc„ < np along which 

• Cross-Polytope. There is a function p^ : [0, 1] i-^ [0, 1] with the following 
property. In the proportional growth setting with p < p^{S), 

Efk{AC'') = fk{C'')-o{l), 0<k<pn, n-.oo. 

Thus, for fc somewhat below n ■ PgiS) there are on average just as many 
fc-faces of the projected cross-polytope as the standard cross-polytope. On 
the other hand, if p > p^((5), then there is fc„ < np with 

A(c^)-£;A(ac^)-.oo. 

The function pg was introduced and carefully studied by the authors in |12| , and 
numerical methods were developed for its calculation and display. The threshold 
function was introduced in calculated, and displayed. 

These strong thresholds have another interpretation. Consider the event "all 
low-dimensional faces survive projection" , i.e. 

nik, n, N) = {MAT^-') = fi{T^-'),i = 0, . . . , fc}. 

Simple arguments as in show that if p < Pg{S), the probability 

P{n{kn,n,N„))^l, n~^oo. 

Hence, below the strong phase transition, all low-dimensional faces survive pro- 
jection. Parallel arguments can be made in the cross-polytope case. Thus in the 
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region where k/n is below the corresponding ps function not only are very few k 
faces lost on average; actually, there is overwhelming probability that no faces are 
lost. 

These p-functions are depicted in Figure [TTTl The strong thresholds and 
fall below the corresponding weak thresholds p^, p^; indeed a property holding 
for every fc-face is less likely to hold than one holding for the typical fc-face. To 
interpret these curves, note that if (5 = 1/2 so we are lowering dimension by 50%, 
then every A:-face of the simplex survives, for k < .1335, while every fc-face of the 
cross-polytope survives, for fc < .0894. 




0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



Figure 1.1. Thresholds for S e (0,1) from top to bottom: 
p^{hlue - solid), p^(blue - dashed), pg{T:ed - solid), and pg(red - 
dashed). 

1.4. Main Results: Proportional Growth Setting. For applications, the range 
where S is small is very interesting; it corresponds to: 

• studying convex hulls of Gaussian point clouds where there are many points 
relative to the number of dimensions - Section [1.1.11 

• recovering a sparse signal from very few samples - Section ll.1.21 

• protecting against errors in digital transmission while sacrificing very little 
in the transmission rate - Section ll.1.31 

Previous work by the authors ^1 El considered the asymptotic behavior of 
the several p{6) functions just defined, and showed that p{6) > Ce log(l/(5)~^^' for 
each e > 0. Work by others 6, 28, 24 can be seen to imply that actually p{S) > 
clog(l/(5)^^. In this paper we determine the precise constants in the asymptotic 
behavior as (5 — > 0. These precise constants are important in applications; they can 
be used to plan how many samples to take in a digital imaging system or how much 
transmission rate sacrifice to make for a given error resistance. 

Theorem 1.2 (Weak Threshold - Simplex). 

(1.6) p+(<5)^|21og(<5)r\ S^O. 
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Theorem 1.3 (Strong Threshold - Simplex). 

(1.7) p+((5) - |2elog((52^^F)p^ S ^ 0. 

Comparing these results: 

• Note the leading factor e in ps- The highest dimension k where the vast 
majority of fc-faces survive projection is asymptotically e times higher than 
the dimension where we can guarantee that every fc-face survives. 

• An additional difference is the 2y/7T factor in the argument of the logarithm. 

Theorem 1.4 (Weak Threshold - Cross-polytope). 

(1.8) p±(J)^|21og(5)r\ S^O. 
Theorem 1.5 (Strong Threshold - Cross-polytope). 

(1.9) p|((5) - |2elog(5\/^)|"\ ,5^0. 

Comparing the cross-polytope results to those for the simplex: 

• Remarkably, to first order, the thresholds are the same for the simplex and 
cross-polytope. This is surprising since at moderate values of 6 the two 
functions are quite different; see Figure im 

• The bounds on strong thresholds agree, except for factors of 2 in the argu- 
ment of the logarithm. 

The weak-threshold asymptotic behavior H1.2|l and (|1.8|1 closely matches 
and p^ for modest values of (5 - see Figure 11.21 The strong-threshold asymptotic 
behavior, on the other hand, slowly approaches pg and pg from above - see Figure 

o 

Weak Thresholds and Asymptotic Behavior 
0.13 I 1 1 1 1 1 1 1 1 1 



0.07 




Figure 1.2. Weak thresholds (red), p^ (green), and their 
asymptotic behavior, |21og((S)|~ (blue), from Theorems 11.21 and 
O 5 e [10-3,10-2]. 
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Figure 1.3. Ratio of the strong thresholds and their asymptotic 
behaviors, PgiS) in blue and Pg{S) in red. The asymptotic formu- 
lae approach slowly from above by the factors shown. 

1.5. Beyond Proportional Growth. Having considered the Vershik-Sporyshev 
proportional growth scenario, we now generalize to the case where N can be dramat- 
ically larger than n. This is important for applications where we want to sample 
very few projections of a high dimensional object. [S| exhibits stylized imaging 
problems where an TV-pixel image can be reconstructed by asking n = 0{N°-) ques- 
tions, a < 1. This of course lies outside the reach of proportional growth and is 
dramatically smaller than N , underscoring the potential interest of the 'how many 
questions' problem of Section Fl . 1 . 21 where the number of questions n <^ N. 

We would naively hope that the same threshold functions p() "work" even outside 
the proportional growth setting. That is, in a setting where n/N^ — > , we would 
hope to get the 'right answer' for the behavior of face counts by simply 'plugging 
in' a varying S — Sn — n/Nn — > into the appropriate p- function. Happily, such 
naive hopes go unpunished. 

We say that N grows subexponentially relative to n if 

^^ "in\ AT / log(iV„) 

(1.10) Nn/n—^ oo, > 0, n^oo. 

n 

Theorem 1.6. Consider a sequence of problem sizes (n, N„) where Nn grows subex- 
ponentially relative to n. Let p be one of the four functions p^, p^, Pg ,Pg ■ Fix 
e > and consider a sequence (fc„) obeying kn/n < p{n/Nn){l—e) forn > Uq. Then 
the same statement that was made for that p in the proportional growth scenario 
holds in this non-proportional growth scenario. 

Thus, for example, fc„ < (1 — e)p'^{n/Nn) ■ n for n = no,no -\- 1, . . . implies 

EfkiAT^-') = A(r^-i)(l - 0(1)), n ^ oo; 

similarly, /c„ < (1 — e)p^(n/Nn) ■ n for n — hq, tiq + 1, . . . implies that with over- 
whelming probability for large n, 
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In short, the hmit relations of Theorems 1 1 . 21 1 . 5l are useful both in proportional 
and non-proportional growth settings. 

1.6. Contents. Our paper proves Theorems 11.211.61 The development is orga- 
nized as a branching tree, with initial sections mapping out the main concepts, 
propositions and lemmas, and later sections dealing with detailed estimates and 
proofs. 

Section 2 introduces the underlying machinery of face counting and an analytic 
approach to studying asymptotic behavior. Our starting point is a beautiful and 
essential formula for the expected number of faces of randomly projected polytopes, 
due to Affentranger and Schneider and Vershik and Sporyshev; it involves three 
factors, representing contributions from combinatorial aspects, from external angles 
and from internal angles. We focus on the exponential growth and/or decay of 
the factors by defining associated exponents '^{S,p); we work as if these factors 
behave exactly as exp{iV • Each pQ function is defined as the smallest root 
= ^'((5, p(S)) of an associated exponent function ^P, viewed as a function of p with 
S fixed. 

Section 3 gives the proofs for the lower bound half of Theorems 11.211.51 The 
proofs are simple consequences of the asymptotic behavior of the net exponents as 
a function of S and p in the regime where 6—^0. 

Section 4 develops the basic asymptotic analysis of the net exponents. The 
exponents in question explicitly involve tail probabilities of the Gaussian distribu- 
tion; our asymptotic analysis exploits detailed estimates for the Mills' ratio of the 
standard normal density. 

Section 5 turns to the proof of Theorem 11.61 going outside the proportional 
growth setting. Here we have to make careful estimates of the errors incurred by 
treating the pieces in the Affentranger-Schneider-Vershik-Sporyshev formula as if 
they grow exactly like exp{7V • 4"}. We refine our analysis associated with Mills' 
ratio, getting remainder estimates assuming 7V„ is subexponential in n. 

Sections 2-5 are preoccupied largely with proving only half of Theorems II . 211 
namely the bounds p(6) > ci / \og{c2 / S) . Section 6 gives the arguments establishing 
inequalities in the other direction, in the process completing the proofs of Theorems 

Section 7 shows how our face-counting results generate the applications men- 
tioned in Section 1.1. It also presents empirical results showing that our asymptotic 
results work at moderate sample sizes, and translates our asymptotic results into 
finite-sample bounds. It also considers extensions of this work, and compares our 
results with other recent work. 

2. Definitions of pt,Pw^Ps'Pw 

The various p quantities referred to in Theorems 11.211.51 have so far been dis- 
cussed behaviorally, by their role in locating or bounding phase transitions in face 
counts. In this section, we review an analytic definition for these quantities given 
in |1U[I12) . The definition unfortunately requires a considerable amount of machin- 
ery associated with convex integral geometry. Equipped with such machinery, the 
claims made by Theorems II .211 .'HI can be translated into sharply-defined questions 
about the leading-order asymptotics of certain exponents. Sections 3 and 6 answers 
those questions. 
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2.1. Expected Face Counts of Projected Polytopes. Let Q be a polytope 
in and A : i?^ i-^ i?" a random ortho-projection, uniformly distributed on 
the Grassmann manifold of all such projectors. Affentranger and Schneider ^ 
developed a useful identity for the expected number of faces of AQ : 

(2.1) Efk{AQ)^MQ)-2Y^' E /5(^,G)a(G,Q); 

here Tk{Q) denotes the set of fc-faces of Q, each is a subface of G, and Yl! 
denotes the sum over ^ = rt + 1, rt + 3, . . . ; < iV. We are intensely interested in 
the discrepancy between the expected number of faces of the projected polytope 
AQ and the necessarily larger number of faces of the original polytope Q; i.e. in 
knowing on average, how many faces are lost in the projection from to R". 
The discrepancy in question is 

(2.2) A{k,n,N;Q):^ fk{Q)-Efk{AQ) = 2Y^' ^ ^ l3{F,G)a{G,Q). 

Here the sum covers the external angles between the original polytope Q and its 
subfaces G, a(G, Q), multiphed by the sum of all internal angles between each 
particular subface G and its faces P{F,G). For definitions of these angles see 
eg. Griinbaum 18 , Chapter 14], or Matousek J25] . 

2.2. Analytic Definition of Pg,Ps- In the remainder of the paper we are always 
interested in just two choices of Q: the simplex, Q = T^~^, and the cross-polytope, 
Q = . Various quantities associated with the simplex case will be labeled with 
superscript + (as the interior of the standard simplex consists of positive vectors) 
and objects associated with the cross-polytope case will be labelled with superscript 
± (as the standard cross-polytope contains vectors with entries of both signs.) We 
frequently use * as a superscript in a statement which concerns either case, implying 
two different statements, with obvious substitutions. 

In the introduction, the functions p~g and were partially characterized by the 
claim that, for (fc„, n, Nn) growing proportionally and limit ratios (kn/n, n/Nn) 
(p, S) with p < Ps{5), then 

(2.3) A(fc„,n,Ar„;Q)->0, n ^ c5o. 

[Note: To make sure the reader follows our convention for the previous sentence 
is actually two sentences, one for the symbol binding (*, Q) = (+,T^~^) and one 
for the symbol binding (*, Q) = (±,G^).] It was also stated that ii p > p*g then 
for some sequence (fc„) obeying A:„ < pn, and some e > 0, 

(2.4) liminf A(fc„,n,7V„;Q) > e > 0. 

n — i-oo 

The papers ^3 E| actually defined p*g (6) with the following stronger property: 
if p < p^{5), then, in the proportional growth setting p.5|l for some e > and 
n > 7io(e,p), we have 

(2.5) A(/c„,n,iV„;g) < 7V„ exp(-7V„e). 

Those papers implied/stated without proof that if p > then for some sequence 
kn < pn, some e > 0, and no we have 



(2.6) 



A(/c„,n,iV„;(3) > exp(iVe), n > uq. 
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While conceptually, both (|2.3|l and (|2.4(l are equally important parts of the 
picture, in practice (|2.3|l is the more useful/surprising. Hence in Sections 3-5 of 
this paper we focus on supporting assertions like (|2.3|) and H2.5|l showing that the 
discrepancy is small, rather than than assertions like (|2.4() and (|2.6|l showing that 
the discrepancy is large. Section 6 will return to (|2.4() and (|2.6() . 

The analysis supporting the bound l|2.5(l for the unit simplex and cross-polytope 
went by first rewriting 1)2. 2|l as a sum of contributions due to faces of different 
dimensions: 

A(/c„,n,Ar„;Q) = ^'d,*„, 
I 

where 

E /3(J^,G)a(G,Q). 

The papers ^1E3 defined functions 7) for 7k- G {+, ±} associated with our 

two choices for Q; these can be used to bound as follows. 

Put vi^n — ^/Nn and 7f^„ = fc„/£, and note that vi^n G [<^, 1] and 7£,n G [0,p] 
over the relevant range I = n + l^n -\- 2, . . . \ I < N . In the proportional growth 
setting 10 showed that, for each e > 0, there is no{e;S,p) so that 

(2.7) 7V-Mog(i?,*„) < 1':,,K„,7,,„) + 3e, £ ^ n + l,n + 3, . . . , n>no, 

Since our focus is the condition H2.5|l . we of course are interested in conditions 
guaranteeing that the right side is negative, uniformly over the admissible domain 
of (j/, 7) pairs obeying ly > S, j < p. 

Definition 2.1. The maximal operator M[] associated to the family of rectangles 
where v E [(5,1], 7 G [0,p] takes a function and delivers the maximal 

function M[tp]{d, p) defined by 

M[V'](5,p) -sup{i/;(i/,7) -.lye [<5,l],7e [0,p]}. 

Applying this operator to each '^^et yields two maximal functions, A/[^'^gj] and 
^'^[^net]' to be studied extensively below. Finally we can give an analytic definition 
for the key quantities in Theorems 11.31 and 11.51 

Definition 2.2. For ★ e { + ,±}, define the strong phase transition pg((5) as the 
'first' zero of M[^'*gj]: 

pUS) = inf{p : M[KJ{6, p)^0,pe [0, 1]}. 

Definition 12.21 is depicted in Figure ITTl 

Several properties of the p^ are known from |10II12| . These functions are strictly 
positive on [0, 1], strictly increasing, with limit as 5 tends to and limits w .3679 
and .1685 (* = +, ± respectively) as S tends to 1. 

The functions Af [^E'^j^J are continuous. It follows that for p < p^{S), 

M[KJiS,p)<0. 

Setting e^\M[^lJ{S,p)\/i, 

(2.8) N-Hog{DlJ <-e, £ = n + l,n + 3, . . . . 
The result follows. 
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M[v;,|i(5,,p;,(s,)).o 







0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



Figure 2.1. Throughout the shaded region ^Xet ^ 0- Therefore 
the maximal function is < at the upper left corner ({5o, (^o)); in 
fact, M[*+,](5o,p) < Oforp < p+(^o), and M[vI/+J(5o, pj(^o)) = 
0; this is the "first" zero of M[]. The family of such first zeros of 
MfcK,^,^) define 

2.3. Analytic Definition of p^,p^- The papers and ^01 also defined phase 
transitions and p^. Conceptually, these quantities are defined by the notion 
that, for n, Nn) growing proportionally with limit ratios (fc„/n, n/Nn) [p, (5), 
then if p < p^^ {5) , the relative discrepancy is negligible 

(2.9) A(fc,n,7V;Q)//fc(g) = o(l), fc = 0, . . . , [pnj , n ^ oo, 

while for p > p^(i5) the relative discrepancy can be substantial; for some sequence 
(fc„) obeying fc„ < pn and some e > and uq, 

(2.10) A(fc, n, TV; Q)/fk{Q) > e > 0, n > no. 

Again while conceptually both 12.9|l and H2.10|l are equally important, practically 
speaking the former is more useful/significant than the latter, which mainly serves 
to show that we cannot substantially improve on (|2.9|l . We will focus on H2.9|l in 
Sections 3-5 and then return to discussion of H2.1()|l in Section 6. 
Define 

(2.11) */ace(^>7)=i^(^7), 

with -ff(-) the type-e Shannon entropy (|2.17|l . so that under proportional growth 
A-Mog/,(r^"i)-vI;+_(p,^). 

Also, put 

(2.12) *^_(^^, 7) = H{v^) + log,(2), 
so that under proportional growth 

iv-iiogA(c^)^*±_(p,5). 

Definition 2.3. For * e {+, ±}, define p^ ((5) as the 'first' zero of -^f^^ J: 

pI,{8) = inf{p : M\^l,, - ^)acM p)^^,P^ [0, 1]}. 
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If p < in the proportional growth setting, then for some e > we have 

(2.13) A(fc, n, N; Q)/fk{Q) < N exp(-iVe), n > no- 
This estabhshes (|2.9|l . 

2.4. Simplex exponent ^^gf. We now give more details about the exponent ^'^gj 
for the Simplex T^^^ . We begin with observations by Affentranger and Schneider 
PP and Vershik and Sporyshev [33], that: 

• There are (^^J A:-faces oiT^-^. 

• For i > k, there are (^^-sT^) ^"f^-ces of T^^^ containing a given k-iace of 

• The faces of T^^^ are all simplices, and the internal angle P{F,G) = 
j3{T^,T^), where T'^ denotes the standard d-simplex. 

Thus, for £ = n + 1, n + 3, . . . we can write 

= C+„•/3(T^T0•«(T^T^-l), 

with denoting the combinatorial prefactor. 

Each of the factors in this product has either exponential growth or decay. We 
will soon define associated exponents '^tonn'^tnt^ ''^^'^ '^txt so that, for any e > 
and n > no {S, p) , 

(2.14) log(C+J < ^tom{^,,n.ll,n) + C 



(2.15) 

and 

(2.16) 



log(a(r^ T^-i)) < -^ttM + e, 



uniformly in£ = n+ l,n + 3, ...;^<iV. 

The exponents were introduced in il2j, which showed H2.14|l - H2.16|l : we repeat the 
definitions, although the reader should not expect much insight at this point. The 
definitions are restated in Section^] equations H4.1|l . H4.5() . (|4.11() . and (|4.16() . where 
further details emerge. The combinatorial exponent involves the base-e Shannon 
entropy: 



(2.17) 
Thus, 
(2.18) 

The internal exponent is 



Hip) = plog(l/p) + (1 - p) log(l/(l - p)). 



(2.19) 



1 7—1 
log(y^/7) + - log(27r) + ^^y^. 



27 



Here is defined implicitly by 
1-7 



(2.20) 



7 



— Sj with R{sj) = 1 — 7, 
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where 

(2.21) R{s) := se*'/2 / ^-y'^^dy. 



The function R{s) is closely related to a fundamental tool for studying tail probabil- 
ities of the standard Normal distribution - the so-called Mills ' ratio of the Normal 
distribution, [531 Sec. 5.37] about which more will be said in later sections. The 
fact that R{s) ^ 1 as s — > oo signifies that the tail probability under the nor- 
mal distribution is asymptotic to times the normal density. Details of this 
approximation will be crucial for our work here. Finally, the external exponent is: 

(2.22) VI/+ ,(^.) ,,xl logQ(x,), 
with Xi, the solution of 

(2.23) ^ + l-.-=0; 

here q{x) := tt^^/^c^^ and Q{x) — J^^q{y)dy. Note that Q() is the normal 
distribution with mean zero and standard deviation 1 / V2, and so Xi, is again asso- 
ciated with the relationship between tail probabilities and density. This definition 
seems at first very similar to the definition of the internal angle; however, note that 
Q{x) ^ 1 as a; ^ oo, while q{x) rapidly. This difference is reflected in the 
behavior of the function of which is very different than the behavior of 

2/-y as a function of 7. 

These ^'-functions are all smooth functions of their arguments. For details on 
these exponents, see either the original source 221i where graphical displays are 
provided, or Section 0] below. 

It follows from ^J^-^J^ that for ^ = n + 1, n + 3, . . . , 

\0giD+J < ^+mi''e,n, ll,n) " ^tntivi,n: ll,n) " ^txM,n. ll,n) + 3e. 

Defining now 

provides us the desired property (|2.7|l referred to earlier, in the simplex case. Graphs 
were presented in |12| showing the exponent's behavior for 7 — .5555 over the range 
V g (.5555,1]. Software is available to make similar graphs for other parameter 
choices. 



2.5. ^'i^et? Cross-polytope case. Boroczky and Henk 1^ previously studied the 
expected number of faces for the randomly projected cross-polytope , and al- 
though the analysis is quite different, we utilize a number of their observations. 

• There are 2'=+i(j^^J fc-faces of C^. 

• For i > k, there are 2^^'^ £-faces of containing a given fc-face 
of C^. 

• The faces of are all simplices, and the internal angle (3{F, G) — (3{T^ ,T^). 

• The external angle a(G^, C^) is the same for all ^-faces of C^, the closed 
form expression of which was originally given in P). A version written in 
our notation was developed in JO!, and is spelled out below in H5.11|l . 



16 



DAVID L. DONOHO AND JARED TANNER 



Thus, for £ = n + l,rt + 3, ... we can write 

= C•±„•/3(T^T^).a(^^^0, 

with the combinatorial prefactor. 

The factors in this product again have either exponential growth or decay. We 
will soon define associated exponents ^'^^j ^tnt^ ^^'^ ^txt that, for any e > 
and n > 7^o(e), 

(2.24) log(C±J < *± „(z.,,„, 7,,„) + e. 



(2.25) 7V-llog(/?(^^T0) < ~^iM,n,li,n) + e, 
and 

(2.26) iV-i log(a(F^ C^)) < ^^f.tM + e, 
uniformly in i — n + l,n + 3, . . .; £ < N . It follows that for n > uq, 

iV-l l0g(i?±„) < '^tom{l'i,n,le,n) " "^tM.n, li,n) " ^f^til^i^n , iLn) + 3e. 

The exponents were introduced in 10 , which showed H2.24|l - H2.26|l : we rehearse 
the definitions, admitting they yield little insight at this point. The definitions are 
restated in Section^ where further information can be obtained. The combinatorial 
exponent again involves the base-e Shannon entropy H: 

(2.27) ■■= H{v) + vH{^) + v\og,{2)- 

thus ^^om — ^tom + f^loge(2). The internal exponent is actually the same as in 
the simplex case: ^f^t — ^tnt- Finally, the external exponent is: 

(2.28) 4-^,(1/) := vyl - (1 - z.)logG(y,), 
with tji, the solution to 

(2.29) ^ + 1 - = 0, 

a.nd g{y) :— 27r~^/^e~^^, G{y) = eif{y) = g{w)dw. G{) is the Error function, also 
called the Half-Normal distribution HN{0, i). Again the ^P's are smooth functions 
of their arguments. 
Defining now 

provides us, in the cross-polytope case, the property (|2.7|l referred to earlier. In 
[TU| it was shown that this is a well-defined and in fact nicely behaved quantity as 
a function of 7 for each fixed v. Graphs in |1L)| portray its behavior over the range 
ly G (.5555, 1] for 5 = .5555; software is available to compute similar graphs as other 
values for i/. 
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ASYMPTOTICS OF AS S 







We now turn to the asymptotics at the heart of Theorems 11.211.51 As indi- 
cated earher, in Sections 3-5 we focus on estabhshing lower bounds on p-functions, 
practicaUy most 'important' or 'surprising' part of our results. 

We introduce a parametrized family of simple comparison functions r* (S) of the 
form |Tlog(c • d)\^^ and control the behavior of p*{S) by studying the maximal 
functions along the trajectories {6,r*{S)) as 6 ^ 0. The central point will be that 
for r > To, each associated maximal functions is asymptotically negative along the 
trajectory {S,r*{S)). This forces p*{6) > r*{S). 

We will also glean insights useful for establishing upper bounds on p-functions. 
It will emerge that fixing r > tq defines a trajectory along which the net exponents 
are asymptotically positive and that fixing t — tq defines a trajectory such that 
the difference between net and face exponents is vanishing; it will be explained in 
Section 6 how this implies the upper bound half of Theorems 11.211.51 

It is convenient to develop the results in a permuted order. 

3.1. Theorem [Ql Fix r > 2e and define r+{d) r+{6;T) |t log((520F)|^i. 
In what follows, r is always held fixed throughout an argument, while S is sent 
towards 0. 

We intend to show that there is Sg = ^^{t) > so that 



(3.1) 



M[^teMr+{6))<0, 



0<6 <6^. 



This establishes the lower-bound half of Theorem lTlH i.e. that p'^{S) > |2elog((52-y7r)|" 
(1 + o(l)). The other half of Theorem 1 1 . 31 can be inferred from the fact that if we 
instead have r < 2e there is So = So{t) > with 

vI/+,(<5,r+(<5))>0, Q<S<6o. 

See further discussion in Section 6 below. 

We start the proof of H3.1(l by observing that the maximal operator M[] be- 
comes 'transparent' in the limit 5 ^ if we stay along the trajectory {S, r^(i5; r)). 
Corollarv l4. II below shows that, if r > 2e, for some Si — Si{t) > 0, 

(3.2) ^+,,iS,r+i6))^M[^LMrtiS)), < S < S^. 

The following limiting behavior of the individual exponents as S 
p^O wiU be derived in Section gl see ^^T^: 



and/or 



{3.3)^tomiS,p) = 
(3.4) = 



(3.5) ^t,{S) = S 



S[log{l/S) + l + 0{S\/ plogp)], max{p,S) - 

O(plogp)], p^O, 
log log 



0, 



-^'5[logp + log(e/27r 



^ogzj - - log log z+ 



l + O 



log z+ 



here :— {S2y/n) ^, 0{x) denotes a term bounded by Const - \x\ for all sufficiently 
small I a; I, and x\/ y is the maximum of x and y. 



From 4' I' 



6 



M[^+J{S,p) 



com 

^ 4 



tnt 



\ogp- 



we have, with p 
log log z+ -f log(2e) + O ( (5 V p log p V 



log log 

log 2/ 



(3.6) 



= S 



log 



2e 



O 



log log z, 
logzj 



0. 
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The 0{) term tends to zero with S. Now t > 2e so log(2e/r) < 0; for some 
(52(t) > the bracketed term stays negative on < (5 < (52(t). Ij^.H! follows with 
,5+(t) =min(,5i(T),^2(T)). □ 

3.2. Theorem 11.21 With H3.6|) in hand, it is now convenient to prove the lower 
bound in Theorem 1 1.21 

Fix r > 2 and define r^{S) r^((5;r) := [r log(l/(5)]^^. We wiU show that 
there is 6^ = S^{t) > so that 

(3.7) M[*+,-*+ J(5,r+(<5))<0, < S < S+. 

Below, Corollary 14.31 shows that the maximal function becomes 'transparent' - 
namely that, fixing r > 2, there is Si = Si{t) > so that 

(3.8) M[net-'^%ce]iS,rMS)) = i'i't,-'i'%JiS,rUS)), for S<Si. 

Recall (|3.3ll - (|3.5ll and 12.11|l . From 14.4(1 as the asymptotics for 12.11|l we obtain 
the following display, in which p = r^{S): 

Af , - (<5, p) = 4 [log p-2p log(l/<5) + log fog z+ + fog(2e) 



- facel \ T 2 

+0 I SV plogpV 



(3.9) = 6^ 



log log z| 



Since log(l + x) < x for x G (— l,oo), by setting 1 + x = 2/t we see that r > 2 
implies log(2e/T) — 2/t < 0. Hence there is S2{t) > so that the term in brackets is 



negative for S sufficiently small. Define now S^{t) — min((5i(T), S2{t)), establishing 



Looking further ahead to proving the upper bound half of the theorem, we record 
the following remark. Fix t = 2. Then as (5 0, 

(3.10) (*+,-vl/+_)(5,r+(J))^0. 

The implications will emerge in Section 6. □ 

3.3. Theorem ll.SL The proof of this lower bound is structurally analogous to the 
proof of the lower bound in Theorem ll.3l 

Fix T > 2e, and define rf{d) :— rg{S;T) \Tlog{S^)\^^ . As in the proof of 
Theorem II. 31 we will show there is Sg = (t) > so that 

(3.11) M[^tMrU6))<0, 0<S<6l 

This establishes half of Theorem 11.51 Again, the other half can be inferred from 
the fact that for r < 2e there is Sq = So{t) > with 

^tM4{d))>o, o<d<So. 

Section 6 will give the details. 

CoroUarv 14.21 below shows that the maximal operator M[] becomes transparent 
in the limit 5^0; for some Si = 5i(t) > 0, 

(3.12) ^tt{S,r${S))=M[^tMr${S)), < 6 < di. 
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The following limiting behavior of the individual exponents as i5 — > and/or 
p will be derived in Sectional see ijHSl), and ^J^: 

(3.13) *±_(<5,p) =<5[log(l/<5) + l + log,(2) 



(3.14) vi/±,(J,p)=- 

(3.15) ^t,,{S) 
where := (Sy/n)^^. 



[logp + log(e/27r) 



-©(^Vplogp)], max(p,^)^0, 
Oiplogp)], p^O, 



l + O 



log log 
log zf 



0; 



Combining asymptotics using ^^^^ = ^f^„^ - - ^f^t yields, with p = rf{S), 



M[^teMp) 

(3.16) 



log p + log log zf + log(2e) + O U V p log p V 



log log 



log ( — 



O 



log log 
log zf 



As log(2e/r) < 0, there is (52 (t) > so the term in brackets is negative for Q < 5 < 



52{t). Setting (5f (r) = uAiy{6i{t) , 62{t)) , (Tmil follows. 



□ 



3.4. Theorem ll.4l Structurally, the argument for this lower bound resembles that 
in the proof of Theorem 11.21 in the same way as the proof of the lower bound in 
Theorem II .51 resembles that in Theorem II .31 

Fix T > 2 and define r^{6) := r^(^;T) [t log(l/5)]-i. Note that r+ (5) = 
r^{S), unlike the strong threshold comparison functions rg{S) and rg{S), which 
are not equal. We will show that for — S^{t) > 0, 



(3.17) 



Corollarv l4 . 41 shows that the maximal function machinery again simplifies for small 
S. Thus for T > 2 and for 6i = (5i(t) > 0, 



(3.18) AfivE-i^-M/^ J(5,r±(<5)) = ( 



face 



){S,r^{5)), 0<S<5i 



Recall (|3.16ll and H2.12() with asymptotic behavior following from (|4.4|l . We have 
the following display, in which p = r^{S), 



M[^tt - */aJ('5>P) = S- [logp- 2plog(l/5) +loglogz+ + log(2e) 



(3.19) 



4 



+0 { dV plogpV 
log ( - ) - - + 0(1) 



log log 

logz+ 



As in the proof of the lower bound for Theorem 11.21 for each t > 2 there is 
(52(t) > so that the term in brackets is negative for all S £ {0,62)- Setting 
S^{t) = min((5i(T), (52(t)), (TnTIl follows. □ 



4. Analysis of the Exponents 
We now verify earlier claims about the asymptotic behavior of the exponents. 
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4.1. Combinatorial exponents, '^*om- The combinatorial exponents for the sim- 
plex and cross-polytope were defined in H2.18|l and H2.27|l respectively; they obey 

(4.1) = ^tomi'^,!) - ^^log,(2) := H{,.) + i.if(7), 

where again H{p) = plog(l/p) + (1 — p)log(l/(l — p)) is the (base-e) Shannon 
entropy. Both identities in (|4.1|l derive from the limit 

/ n 



(4.2) 



n ^ log 



\[pn\ 



H{p), oo, p e [0, 1], 



which of course is fundamental in asymptotic analysis and has proven useful in 
earlier research concerning polytopes ^33. ^ .lOj . The asymptotic behavior 

(4.3) *+„(i/,7) = i'[log(l/j^) + l + 0(z^V7log7)], V 7 ^ 0, 
follows directly from that of the Shannon entropy, 

(4.4) Hip)^p\og{l/p)+p + Oip^), p-^O. 

4.2. Internal exponents, The internal-angle exponent is the same for both 

* = + and * = ±; it was defined in l|2.19|l by 



(4.5) 



1 



log(y^/7) + - log(27r) 



7-1 2 
27 



where y-y was defined implicitly by 

1 - 7 

(4.6) = s^, 

7 



and s-y solves R{s^) = 1 — 7; 



here R{s) - defined at (|2.21|l - is closely associated to a famous quantity in prob- 
ability theory, the Mills' ratio of the standard Normal distribution j^Hl Sec 5.38]. 
The asymptotic properties of s-^ as 7 — > (and hence also of y-y) were studied in 
jlO| using properties of Laplace's asymptotic series for R. In the Appendix, we 
refine that approach, obtaining the following error bounds. 



Lemma 4.1. 

(4.7) y^ 



+ r2(7), with |r2(7)| < 47^/2 for 7 < 1/30. 



The behavior H3.4|l of the internal exponent as 7 — > follows from this lemma 
directly. Indeed, substitute the behavior of y^ given by Lemma |4. II and rearrange 
terms; 



KI-7) 



i.(l-7); 



1 7 - 1 

log(2/7/7) + t: log(27r) + — — y. 



27 



- log7 + log(27r) 



1-7 

2 ( log(l - 7) + 7~'/V2(7) + ^^^2(7) + log ( 1 + 



^-2(7) 
7(1 - 7) 



^(1-7)2 hlog7 + log(27r/e) + 0(7)], 7^0 (by gTl) 



(4.8) 



-2'^[^°S7 + log(e/27r) + 0(7log7)] , 7 0; 
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this is (jnHIl. □ 

4.3. External exponents, ^'g^.j. Each external exponent is defined implicitly 
through a relation qualitatively resembling 

(4.9) /(a;(z),z) =xe^' - z = 0; 

that is to say, we will soon be interested in quantities resembling the solution x{z). 
We briefly sketch an analysis technique for such quantities. 

Our approach approximates the asymptotic behavior of x{z) for z large by 



X2{z) = ^^\ogz~ i log log z; 

the approximation error obeys 

I / X / M log log z 

\x[z) — X2[z)\ < as z ^ oo. 

21ogz 

The subscript 2 signals that X2(z) is the second in a sequence of approximations. 
The sequence starts from a very crude approximation, xi{z), and then improves 
with each stage. The initial approximation, xi := -^/logz, is obtained by treating 
the factor x in H4.9|l as if it were constant, so that instead of solving H4.9|) . we simply 
solve 

el = z. 

This approximation, substituted into equation (|4.9(l . yields an error 

(4.10) /(xi,z) = z((logz)i/2-l). 

The next approximation, X2, comes from attempting to cancel the (log 2)^/^ factor 
in the above error. This is done by solving 

e"' = z(logz)-i/2, 

which indeed yields X2{z). This sequence continues on to increasingly accurate 
approximations, but we stop here because the second term is sufflciently accurate 
for our purposes. 

4.3.1. Simplex case "^^r^f Recall the definition given in l|2.22|l : 

(4.11) 4'+,(i.) ■.^uxl-{l-u)\ogQ{x,), 
where x^ solves 

(4.12) ^ + l-.-=0; 

here q{x) :— n^^l'^e^^ and Q{x) — Q{y)dy is related to the Error function by 
Q{x) — 2(l+erf(x)). Since there is no closed form solution to Q{x) = c as a function 
of c, to analyze the implicitly defined x^, we develop an asymptotic approximation 
using the technique just sketched. Define 

1/2 

(4.13) z+ = z+(iy) := {i'2^)-\ 



log ^ 2 



In the Appendix, we prove the approximation result: 
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Lemma 4.2. There is uq > so that 
= + r3{v), Xi, := 



log - i^^^oglogz"' 



-,1/2 



1 



1 



rs{v)\ < - logz+--loglogz 



-1/2 



log log 
logz+ 



< ly < vq. 



We now plug this approximation into l|4.11|l . and derive the asymptotic behavior. 
As the cumulative distribution of normal Q() famously has no known closed form 
expression, we approximate Q{x) for large x using the asymptotic series [23 Sec. 
5.38], 



Qix) 



e-^' ^ {r- 1/2)1 



2V^x^ {-x^Y 

^ r—Q ^ ' 

Keeping the first two terms, and applying bounds from j2HI eq (5.109)], we have 

1 



Q[x) = 1 



+ 0(x e ), x^ CO. 



Recalling igTTSll, 

we now substitute the approximation to x^ from Lemma l4.2l note 
that = x^ ^ r^{y) with |r4(z^)| < 2(loglogz+)/logz+, and z+ as in H4.13|l . 
Hence, 

Q{Xy) 



= Q(xv + r'i{v)) 



= \-v 



1 - 



log log 



-1/2 



21ogz+ 
log log z+ 



logz^ 



• [1 + r3(z/)/i,]-i e-'-^^") + 0(i//logz+), v 
0, 



l + O 



log log z+ 

log z+ 



logz"*" — - log log z+ + 1 + O 



i/-> 0. 



log logz^ 
log 0+ 



from which follows 

(4.14) logg(a; 
We obtain, finally, 

(4.15) ^t^,(y) = V 
This is (E3J). 

4.3.2. Cross-polytope case: '^f^t- The definition given in H2.28|l was 

(4.16) ^t.ti'^) ,yyl - (1 - ,y)logG{y,), 
with yi, the solution of 

2yG(2/) 



0. 



□ 



(4.17) 



9{y) 



+ 1 - J^"^ = 0, 



where we recall from before g{y) — 27r 



on y > 0, and G'(y) = erf(y) 



g{'w)dw is the Error function. The procedure just used in Section 14.3.11 also 
works here. We merely state results, omitting proofs. 

1/2 

Let z"^ — z^{v) := (ly^)^^, and set y^, — [logz''^ — ^ loglogz''^] 
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Lemma 4.3. There is > so that 

1-1 log log 

This approximation is motivated by the asymptotic series of 2yG{y) / g{y), giving 
y^ey" ^7T-^/^,y-' =0{y-^). 

The series is identical to the series motivating x^, in ^'j^j but now z^{i') = := 
The precise bound on the remainder, r^ii^), can be recovered by following 
the same steps as in the proof of Lemma 14.21 replacing J{x,v) in that proof by 
2ye-y^ e-""^ dw + 



□ 



The asymptotic behavior of the external exponent, 
(4.18) vl/±^(i.) = 



± logz±-iloglogz± + l + 0^^°S^°S^^ 



2 ° ° V log^^ 

follows by substituting y^, as justified by Lemma 14.31 

4.4. MELximal Function for We now support our earlier claim (|3.2|) that 

^"^[^net] — ^riet ^loug the trajcctory {S,rg{S)) for S small enough. 

Corollary 4.1. Fix r > 2e, and recall the definition (5) :— rg{d;T) :— |Tlog((52-\/7r)|~-'^. 
There is Si(t) > so that 

K.et{S,rt{S)) = M[*+ J(5,r+(5)), < <5 < S,{t). 

This follows from two lemmas, proved in the Appendix, which clarify how "^net 
changes with v in the regime of interest. 

Lemma 4.4. Fix r > 2e. There is 6i = 6i{t) > so that for < 6 < Si, and 
< 7 < rj(5), 4'^g((i/, 7) is a decreasing function of v for v £ [S, 1). 

Lemma 4.5. For < 7 < 70, 5'^gf(z^, 7) is an increasing function of j. 

Similar results hold for the cross-polytope [note the slight difference in definition 
between rg{S) and r^{6)]. 

Corollary 4.2. Pick r > 2e and again set r^{S) := rg{S;T) := \Tlog{Sy/Tr)\~^ . For 
S < So{t), 7) obtains its maximum value over v G [5, 1) and 7 < rf{6) at 

(^,7) = (<5,r|(J)): 

^tetiS,r${S)) = M[^ij{6,r${6)). 
We omit the proof, whose arguments parallel those for Lemmas 14 . 41 and 14 . 51 



4.5. MsLximal Function for ^*iet~^/ace- We now consider the maximal function 
associated with the weak exponent, establishing the earlier claim H3.8|l . 

Corollary 4.3. Fix t > 2. There is Si = Si{t) > so that 

This follows immediately from the next lemmas, which are proven in the Appen- 
dix. 

Lemma 4.6. Fix t > 2. For < S < Si{t), < 7 < r^{S), 

(^net ~ ^/ace)('^' '>') decreasing function of v over v G [(5, 1). 
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Lemma 4.7. Fix t > 2. For S e (0,(5i(t)), p e {0,r^{S)) and v e [5, 1), {^X^t - 
^/ace)('^'7) '^"^ increasing junction 0/7 , < 7 < r^((5). 

Similar results for the cross-polytope are obtained by following the same ar- 
guments line-by-line with appropriate substitutions. One obtains the following, 
though we omit the argument. 

Corollary 4.4. Fix r > 2. There is (5i = (5i(r) > so that 

(*net - */ace)('5,'^^('5)) = M\^t.t " */ac J (-5, , < <5 < ^ i . 

5. Beyond Proportional Growth 
Theorem II .61 can be reformulated as follows. 

Theorem 5.1. Let Nn grow subexponentially with n. 

• Strong Exponents. Fix r > 2e and consider a sequence (kn) with kn < 
n ■ r*g{n/Nn]T). There is a sequence (e„) with NnEn — » oo and 

(5.1) 7V-Mog(D*„) < -e„, i = n+l,n + i,.... 

• Weak Exponents. Fix r > 2 and consider a sequence {kn) with kn < n ■ 
r^lnlNn]T). There is a sequence (e^) with NnCn — > oo and 

(5.2) iV-i(log(i^,*J-log/fe(g))<-e„, £ = n+l,n + 3,.... 

To venture outside the proportional growth setting requires to strengthen all 
previous arguments. First, we have to show not just that each maximal function 
is negative before its first zero, but that it is sufhciently negative in a quantita- 
tive sense. Fortunately, the hard work has already been done; summarizing the 
implications of (|3.()|l . (|3.9|) . H3.16|l . and (I3.19|l . we have: 

Lemma 5.2. Let (5„ — n/Nn- 

• Strong Exponents. Fix t > 2e. There are Cg{T) > so that for n > uq 

(5.3) M[KJiSn,rUSn))<~Csir)Sn. 

• Weak Exponents. Fix t > 2. There are Cwi''') > so that for n > no 

MiKet - '^}ace]{Sn,rty{Sn)) < 'Cwi^jSn- 

We must also strengthen the previously-discussed inequalities (|2.14ll . (|2.15|) . (|2.1(j|l . (|2.24|l . (|2.25|) . 
and H2.26|l . giving precise information about the remainders. We start with the 
combinatorial exponent. 

Lemma 5.3. 

(5.4) N-'logCl, < ^:,„,{,^i,n,lLn) + 0{N-Hog{N)), 
where the 0{) term is uniform in £ — n + l,n + 3, . . . . 

The proof is given in Section l5. II We next consider the external angles. 
Lemma 5.4. 

(5.5) 7V-lloga(^^T^-l) <-vI/+,(i.,,„) + 0(7V-ilog(7V)), 
where the 0{) is uniform in i — n + l,n + 3, . . . . Similarly, 

(5.6) 7V-lloga(^^^C•^) <-v|/±,(z.,,„)-)-0(7V-ilog7V), 
where the 0{) is uniform in £ — n + l,n + 3, . . . . 
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For the proof see Section [5?^ We finally consider the internal angles. 
Lemma 5.5. Fix r > 2e. 

(5.7) 7V-llog/3(T^^O < ~^*^M.n,li,n) + o{l)Keti''Ln,li,n) + 0{N-HogN), 

where the o() is uniform in i — n l^n + 3, . . . and in k — 1, . . . , [n ■ rg[Sn)\ ■ Fix 

T>2. 

N'Hog(3{T\T') < -vl/*„,(z.,,„,7,.„) 

(5.8) +0(l)(**,,(l.,,„,7£.„)-*}aceK«,7£,n))+0(iV-M0g7V), 

where the o() is uniform in £ ~ n + l,n + 3, . . . and in k — 1, . . . , [n ■ r'^{Sn)\ ■ 

We also need analogous results for the number of faces of T^^^ and . 
Lemma 5.6. 

log /fe(T^-i) < 7) + 0{N-^ log(iV)), 

where the 0{) is uniform m — 1, 2, . . . , n. Similarly, 

-7V-ilog/,(C^) < vI/±_(t.,7) + 0(iV-ilog(iV)), 
where the 0{) is uniform in k — 1, 2, . . . , n. 
For the proof see Section 15.41 

These Lemmas easily combine to finish the argument for Theorem 15.11 Under 
the subexponential growth assumption log(iV„) = o{n), the remainder terms 

0{N-^ log(iV„)) = o(n/7V„) = o{6n). 

Hence the remainders are much smaller than the bounds on Af[] terms associated 
with (|5.2(l . H5.3(l . Consider the case of the strong exponent for the cross-polytope. 
Uniformly in^ = rt + l,n + 3,..., 

N-'\og{Dfj < M[vI/±J(5„,r|(5„)).(l + o(l)) + 0(7V,7ilog(iV„)) 

< -C|(t) • 5„ • (1 + o(l)) + 0{N-' log(7V„)) 

< -(C|('r)/2) • (5„, n>no. 

Hence ()5.1ll follows, with 7k- = ± and e„ = (C5 (t)/2)(5„. The rest of Theorem 15.11 
follows similarly. □ 
It remains to prove Lemmas 15.31 - This we do in the coming subsections. 

5.1. Combinatorial exponents. Stirling's formula provides error bounds for the 
combinatorial exponents. 

Lemma 5.7 (Stirling's inequality, j2|). 

(2^n)i/2(^)"<n!<^(2^n)i/2(^)" for n>l. 
To verify (|5.4II . recall the combinatorial factors 

and that n — £/N and 7^ „ = k/£. 
Using Lemma I5. 71 we arrive at 

(5.9) c*^ < A .7Ve^*:o™(-^7)^ 
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establishing H5.4|l . 
5.2. External Angle. 

5.2.1. Simplex case '^txt- It is enough to show that uniformly over I — n + l,n + 
3,..., 

(5.10) 7V-iloga(T^T^-i) < ,(z.,,„) + 7V-i log (iV) ; 

of course, the remainder term is 0(log(A'^)/A^). The simplex part of Lemma 15.51 
follows. 

The external angle for the simplex is given by 

(5.11) a(^^T^-l) = yZIl (^-^ |' g-^^dy) """''^x. 

As before, Q{x) :— n^^/^ J^^e^^^ dy. Recall that z/f_„ — £/N and rewrite the 
simplex external angle as 



(5.12) a(^^T^-l) = xl^- I exp(-7VK„x^ + (i.,,„ - 1) log Q(x)])^dx. 







The factor N in the integral might suggest the use of Laplace's method as in |l()j . 
A simpler, direct approach is possible. The following is obvious but very useful. 

Lemma 5.8. Let ijj : [0,oo) i-^ R achieve its global minimum at x* and let ip : 

[0,cxd) i-^- [0,oo) be integrable. Then 

POO poo 

(5.13) / exp{—N^I;{x))ip{x)dx < exp{—Nip{x*)) / Lp{x)dx. 

Jo Jo 

Recall that x^, is the minimizer of [vx'^ + {v ~ 1) logQ{x)], and 

■■=i^xl + ii.~l)logQix,). 

Apply Lemma to the integral (|5.12l) : set ^"(2;) — + (^^ ^ l)logQ(a;)] and 
ip{x) = e~^^ /Q{x). Because ip{x*) = ^txti^) and J ip = (37r)/8, lf5Tl|l yields 



(5.14) a{T', T^-i) < e-^*^-M . . / i±l . ^ < ^aTTT • e-^*-^''). 

V TT 8 

(jFlUjl follows. □ 

5.2.2. Cross-Polytope case ^'^^.j. Our goal is to prove (|5.6|) . We introduce a per- 
turbed version of vi^n] 

1 

Note that vi^n € [0,1) as is the unperturbed vi^n- This is used in our first step, 
where we find that it appears naturally in the bound 

(5.15) iV^Moga(F^C^) < -*±,(%0 + 7V-ilog(7V), TV > 3. 

Note that the remainder is 0{N^^ log(A^)) uniformly over £ e [n+ 1, — 1], as our 
goal requires. Indeed (|5.6|l is an inequality like H5.15|l but with rather than v^^n- 
In our second step, we verify that the perturbation of the argument is unimportant: 

(5.16) '^tA^i,n) = "^ixtM + 0{N-^ log(7V)), 
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uniformly over £ ^ n + 1, n + 3, . . .; £ < N . The cross-polytope half of Lemma l5?5l 
then follows. It remains to show (|5.15(l - (|5.16|l . 

The external angle for the cross-polytope is given by 

N-e-i 



(5.17) 



£+1 



dw 



dy. 



Following the same approach as for the simplex, recall G{y) := 2tt e dw 

and rewrite the cross-polytope external angle as 

(5.18) 

/ ,2 \ 1/2 



£+1 



eM-N[vi,ny^ + ive,n - 1) logG(y)]) 



G(y) 



dy. 



Let Tpiy; v) = vy^ + [v — \) \ogG{y)], and yi, be the minimizer of ■)/)(•; v). Set 

^^tA^) ■■= vyl + - 1) logG(y,). 

Apply Lemma l5^ to the integral H5.18|l : set ij) — ^'{■,'^i,n), and ip{y) — (^fj(^) 
The factor exp(— 7V-0(a;*)) = exp i-N^'t^^{^)i^n)), while Jip< 2.175; we obtain 



1/2 



(5.19) 
Hence 



c,(F^C^) < -V£TTeM~N^t.ti^i,n)), 
N~' log a{F',C^) < -^t,M + N-Hog(^^V£TT^ , 



from which H5.15|l follows. 

We earlier studied the asymptotic behavior of ^'^((i/); see (|4.18|) . The effect of 
the perturbation 1/2N in Vg^n can be bounded simply. Put = (;?£.„). Then 

'log log 



log Zn 



log^„ - - log log Z„ + 1 



+ logz^ V 



± log log z± 



log Zn 



Our goal (|5.16(l follows. Combined with (|5.15(l we obtain (|5.6ll . 



/N, n oo. 
□ 



5.3. Internal angle. We aim to demonstrate (|5.7|l . We again introduce perturbed 
variables: 

£+2 , , fc+1 



N 



Our plan is to first show that for n > no 
(5.20) iV"i log /?(T^TO <-^,„t(z>,7)+A^"'log 



£ + 2 
2 



(TV + 3)5/2 



The remainder here is 0{N ^log(A^)). We then show that the perturbation of 
variables has a negligible impact: 



(5.21) 



uniformly 'm < k < n ■ rg{5n). Our goal (|5.7(l follows. It remains to prove H5.2()|l . 
(ESI- 
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An expression for the internal angle was developed in 



(5.22) 



1/2 



2'=-V+M/„(0); 



here gx+Wrrt (0) denotes the probability density of a certain random variable ex- 
pressible as a sum of m + 1 independent random variables; here m = £ — k + 1. 
[TP] used large deviations analysis to bound this term using a certain nonnegative 
convex rate function A* : [0,oo) >—>■ [0,C!o); the bound was: 



9T+w,^ (0) < 



Z m 



/2/t: 



y exp I —TO 







(g)y^ + A*(2/)])dy + 



say, with to = £ — fc + 1 and 9 = fc + 1. The second term was argued to be negligible 
in the proportional growth setting by soft analysis; later below we will check that 
it is still negligible in the current non-proportional growth setting. 

Focusing on the supposedly dominant term /,„, substitute in the values for to 
and 6, and rccaU that 7 = (fc + + 2): 



1 [l-k + lf 



/2/tt 



m (fc-hi) 

The integral here can be rewritten as: 



2/ exp -(f - fc + 1) 



7 / 2 



^+A*(2/) 



dy. 



J 11 



yexp(-7Vi>(l - j)^^iy))dy. 



where, consistent with earlier definitions, 

'1-7A 



7 



A*(y) 



Note that is the minimum of ^7(2/)- Again apply Lemma 15.81 to bound Jm', 
setting -0 = i>(l - 7)^7(2/) and ip = J/ljo^^/^j, (ESI) gives 

J„ < exp(-iV£>(l - 7)C^(2/^))/7r. 

Note that 

«'»„t(i>,7) := '^(l - 7)^7(2/7) + log 2], 

and so 

2'=-^-! exp(-iVz^(l - 7)^7(^7)) = expi-N^,^t{i>, 7))- 

Noting the presence of a factor 2^^^ in H5.22|l and noting that ^ -I- 1 — fc < iV, we 
obtain 

(3{T^,T^) < 2 



(5.23) 



< 2 (iV + 2)'/' • exp(-iV*„t (i^, 7)) -f + ■ 2^-' II, 



This essentially verifies (|5.20f) . 

However, it remains to verify that //,„ <C Im- Put /i — \f2pK and recall from ^HI 
that \x = £'(T -I- M^m). We focus any ^ [x— \p2fK and use the fact that the large 
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deviations rate function always vanishes at the underlying mean, i.e. A*(/x) = 
essentially by definition. Then 



It follows that 

//„ - cxp(-iV • u{l - 7)^7 (Ai)) 
But by definition of yx^ as the minimizer of and the asymptotic yx^ — s- 0, 

in fact ^^{n) ^ 7~V^ > log(7"^) ~ ^7(2/7) as 7 < r^((5„)(l + o(l)) 0. Hence 
Ilm is exponentially smaller than J„j, and H5.2U|I is fully proven. 
As for l|5.21|l . recall that 



(5.24) 
while, if /9 = I'si^) 



'frnt{i>,l) = - 2^ [log7 + log(e(l + k-^)/2n) + o(l)] 



log 



+ log(7/)+o(l) 



Look now in the vicinity of fc = "fn, where 7 = ?7 ■ r^((5„). 



2s +log(r;) + 0(1)1 



o(l). 



Here all the o(l)'s are uniform in < r/ < 1. 

The argument for (j5.8|l is similar to that of H5.7|l detailed above, replacing ^ 
by - 



□ 



5.4. Face Counts of ^ and . The number of fc— faces for the simplex and 
cross-polytope are 



fkiT 



N 
k + 1 



N 
k + 1 



and /fe(C~) = 2'=+i 

Invoking Lemma I^TtI and recalling that i/£.„ f/A^ and 7^^„ = we arrive at 

2 



(5.25) 
and 



25 V TT 



(5.26) 
establishing Lemma 



25 V TT 
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6. Upper Bounds on Phase Transitions 

Until this point, we have focused on estabhshing fower bounds on the several p- 
functions introduced in Section 1. Our work so far has given the lower-bound "half" 
of Theorems 11.211.51 we now give upper bounds on the p-functions and complete 
the proof of Theorems 11.211.51 

We remark, parenthetically, that the "half" already proven is the more surpris- 
ing/interesting part of the result, in view of applications. However, the remaining 
part settles any question about whether the lower bounds have slack, i.e. whether 
they actually agree with the precise phase transitions. 

For establishing lower bounds on the p's, we have been applying upper bounds 
on the combinatorial factor, and on the internal and external angles. Now that we 
want upper bounds on the p's, we will turn to lower bounds on the combinatorial 
factor and the angles. 

The required lower bounds will be developed in later subsections of this section, 
effectively we will be using standard ideas such as Stirling's inequality, Laplace's 
method and the Saddlcpoint method. 

Before turning to those lower bounds, we give the arguments completing the 
proofs of Theorems 1 1.211.51 

6.1. Upper Bounds on Strong Phase Transition. The key to tying down the 
strong phase transitions p*g is to use the fact that •) makes a sign change 

at p*g. Indeed, by definition, ^*-^^^{6,p) has a zero bX p — p*{S); but actually it is 
strictly increasing in the vicinity of this zero. For sufhciently small e > 0, we can 
find p — pe > p*s {5) so that 

vl'*,,(<5,p)>2e. 
Set now fc„ = [pefij ; for all sufficiently large n, 
, X * /n + 2 fc + 1\ 

(6-1) ^-(^'^)^^- 

We now invoke lemmas placing lower bounds on the combinatorial, internal and 
external angle factors. 

Lemma 6.1. There is an absolute constant ci > so that 
> ci • —rj^ ■ exp N^*, 



]Sf3/2 - corn y ^ ' £ + 1 

The next lemma is more than we really need at this stage; the extra generality 
will be useful in discussion of the weak phase transition in the next subsection. 

Lemma 6.2. In the proportional growth setting, we have constants C2, C3, and C4 
depending at most on 5, so that, for ^ = n + l,?T. + 3,..., i < n + ^/N, fc = fc„ = 
\_Pen\ , and n > uq: 



int 



(6.2) P{T\T')>C2-e^^[-N^l 

(6.3) a{T\ r^^i) > C3 • exp ^-iV* 

(6.4) a(F^ C^) > C4 • exp ( -TV^' 



l+l k + l 



iV ' £+ 1 
fc + 1 



ext 



N ' l+l 
-fl k + l 



N ' t+1 
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Dn+l,n > C5£l/2iV-3/2 exp ( 7V*^,, 



Combining the last two lemmas, we get - specializing to the case i ^ n + 1- 

'n + 2 k + V 

> C5f/^N-^/^ exp(iVe) ^ oo, 7V„ ^ oo. 

As fk{Q) — Efk{AQ) > -D*+i „, we conclude that A{kn,n, Nn) ^ oo as n ^ oo; 
this completes the upper bound for the strong phase transition p*g. □ 

6.2. Upper Bounds on the Weak Phase Transition. We now aim to show 
that, in the proportional growth setting with n/Nn 5 > Q, and k — 

(6.5) Uk{Q) - Efk{AQ))/fk{Q) > e > 0, n> 

In words, 'above Pw{5) a nonvanishing fraction of faces get lost under projection'. 

In fact we will show that for all large enough n, and for all i in the range 
n+l,n-|-3,...,£<n + \/N, 

(6.6) Dl^/fk{Q)>cn-^'^, £==n+l,n + 3,...; £<n + ^/iV. 
Since this inequality holds for at least \^^/~N terms from the sum we have 

A(fc„,n, Ar„) = Y.'Dl^ > efkiQ), 

for e — c/2, which implies (|6.5I) . 

The different structure of our argument in the weak transition case can be traced 
to the fact that "if net ~ ^/ace '^^^^ i^ot change sign at p = p^y. Instead, it achieves 
its global maximum 0. This means that 

-^iKet-'f}ace)i'^,Sp/^)^0 

from which it follows that, for v £ [S,5 + 1/^/N], and some c > 0, 

{net-'^}ace){'^,Sp/,^)>c/N. 

The combinatorial identity 

n \ /' n — k + l\ fn\f £ 



implies 



k + lj\ l-k ) \lj\k + l 
DtJh{C^)^2-{^-^^^\-P{T\T')a{T\C''). 



and 



We need the following combinatorial result; it follows from Stirling's inequalities 
fLemma l5.7|) and we omit the proof. 

Lemma 6.3. 

N -k-l\ I, , f , / i-k 

> -{N -k - 1)^1/2 exp {N -k-l)H 



£-k J - 3^ ' ^ V ' \N -k-\^ 

We combine this with Lemma 16.21 and get that, under proportional growth 

^+1 fc+ 1' 



DIJHQ) > cN-''^ ■ exp (^NiKet ' 
This implies (|6.6|l and 1)6. 5|l follows. 



N ' £+1 
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6.3. Analysis of the External Angle. Simplex case. We recall the exact formula 



(6.7) aiT'X"-')^ j eM-NMx))dx, 

where ij^uix) = vx^ — (1 — v)\o'gQ[x) and v — Note that throughout this 

Section, 1^31 we use the convention v = consistent with Lemnia l6.2l In the next 
subsection, we use Laplace's method to obtain lower bounds on general integrals of 
this type. That lemma requires estimates which are, in turn supplied by Lemma 
16. 41 below. Here the exponent 'i/'jy should not be viewed as constant in TV; it depends 
on the variable v = -^iJ- which varies slightly; also £ is a variable which ranges in 
the vicinity of n. Lemma 16.41 gives lower bounds on Laplace integrals with uniform 
multiplicative remainders; this yields that for a fixed subinterval < fo < i^i < 1, 
and for each e > there is Afo(e) so that for all v = [Ji + 1)/^ in ('^Oi v\\ 



/ eM~NM^))dx > /— — — exp(-iV*,(a;,)) • (1 - e). 

We conclude from (|6.7|l and H6.8() that for each e > and all sufficiently large 



a(T^T~-i) > + 2a;2) • exp(-ArvI/+ ,(^.))(1 - e), 

where the threshold A^o niay be taken locally uniform in i/ G [0, 1). Here, again, 
is the minimizer of 'ip^{x). It follows that there is a constant c > so that for all 
sufficiently large iV, and all v & In , In = [5,5 + \/\/N], 

a{T\T''~')>c-eM-N-^t.M)- 

Equation (|6.4|l follows. 

Lemma 6.4. Let tpu{x) = vx^ — (1 — ^)^ogQ{x). Then ipv{x) is C^(0, oo), 

(6.8) ^;;(^^) = J!^(i + 2a:2). 

1 — 1/ 



C'(x.) = -(i-^.) 

For e > 0, set 



4;/ 3 2Av 2 16i/ 3 



\x—x,j\<t VvK-^v) 

Then for small e > 0, C{v, e) < 00, and as v 0, C{v, e) ~ 2xv 
Cross-polytope case. We recall the exact formula 



a(F^C^) = \ I exp{~NMx))dx 



n 







where V'i^(x) = vx — (1 - iy)logG{x) and 1^ = {£ + 1)/N We apply Lemma lOI 
bounding Laplace integrals with multiplicative remainder to conclude 



1 f / 27r 

eM-NMx))dx > J ^ exp(-A^^,(a:,)) • (1 + o(l)). 
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Here the o(l)-term is locally uniform over v E [0, 1). We conclude that for each 
subinterval {lyg, t^i) with < I'd < t^i < 1 and for e > we have for N > No{h', e), 

a{F\ C^) > (1 + -^^l)-"^ ■ eM-N^t^m^ " 

Here, again, cc^ is the minimizer of ^^(x). It follows that there is a constant c > 
so that for all sufhciently large N and all v E , In ^ [6, 6 + 1/^/N], 

a(F^C^)>c-exp(-iV*±,M). 

Equation (|6.3|l follows. 

Lemma 6.5. Let ^p^{x) — ux^ — (1 — J^)logG'(x). Then ^pu{x) is C"*(0, oo), 

,p'l(x,) = 2i^-{l + xl-^). 

\ — V 



-^(2 - 4a;2) + %x^z^ + 2x^t^ 
\ — V 



where = y^^- For e > 0, set 

Then for small e > 0, C{v, e) < oo, and as v 0, C{v, e) ~ 4x^. 

6.4. Uniform Laplace's Method. We use a uniform variant of Laplace's method, 
suitable for bounding a collection of integrals uniformly. The approach is similar 

to [ini. 

Lemma 6.6. Let I = [— e, e] and suppose that f attains its minimum on I at 0. 
Let 

^"T no) ■ 

Then 

expi-Nf{x))dx > Jj^^j^yeM-mO)) ■ R{e,N). 

Rie,N) (1 - y|exp(~7Ve2/"(0))) .exp(-7V/"(0)CeVl6). 

The derivation of the lemma is similar to that of Lemma A. 4 in 10 (although 
with all inequalities reversed). 

Lemma 6.7. Consider the collection of integrals 

/"OO 

J{N,X) = / exp{~Nfx{x))dx 
Jo 

and suppose either that A = Aq independent of N or that X — Xq as N ^ oo. 

Suppose that fx has a unique minimizer x\ interior to (0,oo) and suppose that fx 
isC^{0,oo). Let 

C(A,e)= sup ^M. 

Suppose that 

^/AN(a;A„) -> oo, 
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C(Ajv, Eat) 



Then 



J{N, Xn) > J ]vyl^ • cxp{-Nh{xx)){l + o(l)) 



To prove Lemma 16.71 simply translate coordinates so that xx = 0, pick eat = 
N~^^^ and set / = [— e, e], then apply Lemma[ 



6.5. Analysis of the Internal Angle. Our earlier analysis of the internal angle 
employed an upper bound derived in jlU| from large-deviations theory. We now 
develop a lower bound using complex analysis techniques; our analysis is related to 
the approach of Vershik and Sporyshev . 

Let X ^ HN{0, 1) be a real half-normal random variable, i.e. X — \Z\ where Z is 
standard normal. The moment generating function M{t) — Ee*"-^ can be continued 
to the complex plane. We have the explicit formula M{t) e* • 2$(t), where 
$ denotes the standard N{Q, 1) cumulative distribution function. Operations with 
Taylor series show that for lj real, $(ict;) has real part 1/2 along the imaginary axis 
and so the cumulant generating function log(2<i>(z)) can be consistently defined in 
a neighborhood of both the real and imaginary axes. Define 

V'^(z) = zV2+(l-7)log(2$(z)). 

We begin by justifying our interest in the complex domain: 

Lemma 6.8. For 7 = j^, 

1 rioo 

(3iT\ T') = VTTS • 2-^-^-+i • / e(^+2)v..(.)^^_ 

v27r J -ioo 



Contour integration was previously used in the analysis of the internal angle by 
Vershik and Sporyshev, without making the connection to the cumulant generating 
function. The contour integral and the form of the integrand suggests to use the 
method of steepest descents An analysis of ip-yi^) is easily performed computa- 
tionally. One learns that there is a path C-y along which ip-y^z) is purely real and 
which is asymptotic, for large \z\, to the imaginary axis; see Figure |0] This path 
crosses the real axis at a point z-y. Because tpf is real for real z, z-y is necessarily a 
saddlepoint of ip-y . Within the region bounded by the imaginary axis and C-y , ip-y 
is analytic, and so we have the identity 

/too /• 
-ioo <y C-y 

provided the orientation of the path C-y is chosen properly. Parametrizing by ar- 
clength, the contour integral can be rewritten purely in terms of real variables: 

where ip{t) = ip.y{z{t)); this of course is in the form of a Laplace integral. Taking 
into account that 

vi^(o) = v7(^7): ^';{o) = ^';iz^), 

and that tpj{t) is C"'(— 00,00), we immediately have: 
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Figure 6.1. Level curves for the exponent tp^yiz) with 7 = 3/8; 
real (a) and imaginary (b) components. The path along which 
ip^iz) is purely real is overlaid as the wider black line. Panel (b) 
additionally overlays the level curves with the imaginary part of 
ij-y{z) being equal to — tt (green) and tt (blue). The path C-y lies 
between these hyperbolae and the imaginary axis, allowing the 
deformation in Ht).9|) without necessitating branch cuts. The sad- 
dlepoint, Z3/g w — 0.907 + «0, is indicated by the red circle in Panel 
(b). 



Lemma 6.9. Let 7 be fixed in (0, 1). V'7(^) has a saddlepoint z-y on the negative 
real axis and 



27r 



(£ + 2)<(z^) 



exp{(^ + 2)V-y(27)}- (1 + 0(1)), ^^oo. 



Actually, however, we are interested in the case where 7 is changing slightly 
with n, i.e. 7 = 7„ = ^"^^ , and need a stronger result. We note that the third 

and fourth derivatives of ip-yit) near t = are bounded locally uniformly in 7. We 
conclude: 

Lemma 6.10. Fix c > 0. Let 7„ — ^"^^ . In the proportional growth setting, we 
have 



27r 



■ exp{ (^„ +2) V'^„ (z^„ ) } • ( 1 +o( 1 ) ) , n 



_ (£ + 2)^;'Jz^J 
with the term o(l) uniform in n < £„ < n + Cy^. 

To complete the evaluation of the asymptotics of the internal angle we need 
Lemma 6.11. 

^;'(z,) = i-.^^ 

Let ^-^(j/) denote the function introduced earlier in connection with the internal 
angle. Then 

i'jizj) = -(1 - 7) ■ ^7(2/7)- 
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We conclude that 

+ 1 k + 1 



(6.10) (3{T\T')>C2-exp^~N^tnty ^ 
The result follows. 

6.6. Proof of Lemmas 16.81 and 16. Ill 

6.6.1. Proof of Lemma \6.8l Borozcky and Henk gave the formula 

(6.11) (3{T'',T'^) = 6»("-i)/2 . ^(^jn^i)a + \ ■ ir^''"^^ ■ a^^/^ . g,)^ 
where 

(6.12) e = k + l, a = l/(fc + 2), m = £-fc + l, 
and 

poo ^ / poo \ m 

J{m,0) = —J (^'^yj exp{-ev^ + 2iv\)dvj dX. 

Note that 

/ exp{-ev^ + 2ivX)dv = ^ • Ee'^^^, 

where X is standard half normal, X = \Z\, Z ~ N{0, 1). Using now the cumulant 
generating function of the half-normal, 

A(z) = logSe^^, 

we write 

^m/2-1/2 i-oo 

j^rn, 0) = ^ • / e(*^)%"^(^^^)dA. 

Now change variables uj — \J^X^ and write 

^m/2-1/2 /TT /-oo 

Recalling H6.11|) - (|6.12() and noting that 

e^k + l, ((TO + l)a + l)/a = ^ + 3, 

we have 

1 poo 

P{T\ T') = V7T3 • 2-" • • / e-('=+i)"'/2+(^-'=+i)^(*")dw. 

v27r j-oo 



The cumulant generating function of the half-normal obeys A(s) = e'' • 2<i>(s). 
Setting 7 = |ii the exponent can be rewritten as (£ + 2)^^{z). □ 
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6.6.2. Proof of Lem,m,a r6.11\ Note that 

A'(z) = 0(z)/<i>(z), 

where "0 is the standard normal density and <f> is the standard normal cumulative 
distribution function. Hence from ip'^iz) = z + (1 — "f)(f>(z)/^{z) and ^p-y(zj) = 
we have 

(6.13) -zV(l-7) = '^(%)/*(^7)- 



We also have 



and (p' = (—2)0. Hence 



a2 



<W-l + (l-7)[^-J,] 



1-7 

This proves half the lemma. 

For the other half of the lemma, we need to establish a connection between the 
values of i^y{zj) = z^/2 + (1 — ■'y)A{zj) and ^-yiy-y), where 

7 

Here A* (a;) — maxgsx — A(s) is the the classical Fenchel-Legendre transform of 
cumulant generating function on the real axis. It is worth reviewing Sections 6.4 
and 6.5 of jJUj. The definition of A* sets up a one-one relationship between variables 
(y, s), where y = y{s) and s = s{y), where 

A*{y) = s{y)y~-A{s). 

Hence 

1 — 7 

^7(2/7) = 5(2/7)^7 - ^(^7) 2^7/2- 

7 



Formula (6.12) in ^HI reads 



1-7 . 

2/7 — ''71 



this implies 



^7(2/7) = -Y^4/2-A(s^) 
i — 7 



^_^-4/2-4/2"log(2<i>(s7)) 

= --^4/2-log(2<i>(s^)). 

1 — 7 ' 

We note - parenthetically - that the variable s is in this subsection the argument 
to a cumulant generating function, and elsewhere in the paper, the same symbol 
denotes the negative of this same quantity. Moreover the dual relationship between 
s, y variables is expressed through A'{s^) = y^. We compute that ip-y{sj + iO) = 0, 
i.e. z^ = s-y + iO. In words, the saddlepoint value Zj is identical to the dual variable 
s-y. Finally we have 

-^,„ti>^, 7) = -(^7(2/7) + l0ge(2)) • • (1 - 7) = (^7(^7) + l0ge(2)(l - 7)) • 

Compare also section 6.5 of ^Ol- D 
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7. Discussion 

In this section, we first show how the applications (1.1)-(1.4) foUow from Theo- 
rems ll.2l - ll.6l We next consider the performance of these rules at finite n. Finally 
we discuss extensions, open questions, and relations to other work. 

7.1. Convex Hulls of Gaussian Point Clouds. Proof of (1.1). In the 1950's, 
David Gale |16| introduced an important extremal property of polytopes; the fol- 
lowing is now classical: 

Definition 7.1. |18[ Chapter 7] A convex polytope P is called fc-neighborly if 

every subset of A: + 1 vertices spans a fc-face of P. 

By mere face counting, we can determine whether a polytope is /c-neighborly. In 
this section, put for short T = T^^^ . 

Lemma 7.2. QSJ Chapter 7] Let P = AT. Suppose that 

• P has N vertices. 

• P has (j,^]^) k-faces. 

Then P is k-neighborly. 

Combining Theorems 11.31 and II . 61 we have: 

Corollary 7.1. Let {kn,n, Nn) be a sequence of triples with n tending to oo, and 
Nn growing subexponentially with n. Fix e > and suppose that 

kn<{l~e)-- — - — — — „ , n>no. 

2e • log(iV„/(n • 2y^TT)) 

Let A be a random n x N matrix with iid iV(0, 1/n) entries. Define the event 
n{k, n, N) = {P = AT is fc-neighborly}. 

Then 

P(ri(fc„, n, iV„)) — > 1, as 71 — > cxD. 

In words, with overwhelming probability for large n, P — AT is at least A:„- 
neighborly. 

This is simply (1.1) in another language. To see why, note that, for each fc > 1, 
a A:-neighborly polytope is also k — 1-neighborly. If ai, . . . , on are vertices of 
A = conv(ai, . . . a^r), then fc-neighborliness of A is equivalent to the following k 
simultaneous properties: 

• every pair [at , Oj ) spans an edge of A 

• ... 

• every k + 1-tuple (a^^ , . . . , a^j.) spans a fc-face of A. 

This is precisely the condition mentioned in Section 1.1.1 with the substitutions: 
Xi Ui, A X , n d, and N n. 

To conclude, we note that P = AT has N vertices with probability 1, those 
vertices are simply the columns of A, and so P = conv(ai, . . . , ajv). Invoking now 
the above corollary we obtain the conclusion (1.1). □ 
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7.2. Correcting all patterns of k or fewer errors. Proof of (1.4). A con- 
vex polytope is centrosymmctric if it lias 2N vertices made of N antipodal pairs. 
Neighborliness per se does not apply to centrosymmetric polytopes, instead one 
needs the following notion: see eg .18, Chapter 8]. 

Definition 7.3. A centrosymmetric convex polytope P with vertices ±ai, . . . , ±aN 
is called centrally fc-neighborly if every subset of fc + 1 vertices not including an 
antipodal pair spans a fc-face of P. 

By face counting, we can determine whether a polytope is centrally ^-neighborly. 
In this section, put for short C — . 

Lemma 7.4. (101 Lemma 1] Let P — AC. Suppose that 

• P has 2N vertices; and 

• P has 2'=+! • (^^J k-faces. 
Then P is centrally k -neighborly. 

Combining Theorems 11.51 and 1 1 . 61 we have 

Corollary 7.2. Let {kn,n,Nn) be a sequence of triples with n tending to oo, and 
Nn growing subexponentially with n. Fix e > and suppose that 

fc„<(l-e)-- — - — — 71 > no. 

2e • log(A^„/(n • y/Tr)) 

Let j4 be a random n x N matrix with iid iV(0, 1/n) entries. Then P = AC is a 
random centrosymmetric polytope. Define the event 

ft{k,n,N) = {P = AC is centrally fc— neighborly}. 

Then 

P{fl{kn,n, Nn)) 1, as n — > cx). 

In words, with overwhelming probability for large n, P — AC is at least fc„- 
centrally neighborly. 

We now relate central fc-neighborliness to (1.4). Recall the optimization problem 

(Pi) min||x||i subject to y = Ax. 

X 

Call the solution xi] it obviously depends on y and A. 

Theorem 7.5. [TTj The following statements about an n x N matrix A are equiv- 
alent. 

• The polytope AC has 2N vertices and is centrally k-neighborly. 

• For every problem instance y = Axq where xq G has at most k nonzeros, 
the solution xi to the corresponding instance of (Pi) is unique and is equal 
to Xq. 

To apply this, recall the setting of Section Fl.l.^^l The encoding matrix B men- 
tioned there was obtained as follows: a random orthogonal matrix U is generated, 
and B makes up A'^ — n rows of this matrix. The checksum matrix A makes up the 
other n rows of U. 

Given received data w e R^, form the generalized checksum y = Aw € R". 
Then solve the instance of (Pi) defined by {y,A). Define the reconstruction ui = 
B{w — xi). (1.4) now follows from the above, and the following: 
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Claim. If AC is centrally A:-neighborly, and if the error vector z has at most k 
nonzeros, one has perfect error-correction: 

u — Ul. 

Proof. The received message w = B^u + z where the error vector z has, by 
hypothesis, nonzeros in at most k positions. Since AB^ = 0, y = Az. Invoking 
Theorem l7.5l we have xi = z. Hence B{w—xi) = B{w—z) = BB'^u+z — z = u. □ 

7.3. How Many Projections? Proof of (1.2). We first transform the "how 
many questions" problem into face counting. 

Definition 7.6. The random n x N matrix A will be called orthant- symmetric if, 
for every signed permutation 11, and for every measurable fl C R"^^, 

P{A en} = P{AI1 e Q}. 

Theorem 7.7. Let A be an orthant symmetric random n x N matrix. Let xq 
be a fixed vector with k nonzeros. Form a random problem instance {y, A) of {Pi), 
where y ~ Axq. Let x\ denote the solution of this instance of {Pi). 

Eh-i{AC) 
P{xi = xo) > — -— . 

Jk-l{C) 

Theorems II .41 and II .61 implv the following precise version of (1.2). 

Corollary 7.3. Let {kn,n, Nn) be a sequence of triples with n tending to oo, and 
Nn growing subexponentially with n. Fix e > and suppose that 

TL 

(7-1) kn<{l-e)-— — , , n>no. 

2 • log(7V„/n) 

Then 

Efk-i{AC) 

— — .1, n^oo. 

Jk-l{C) 

In words, for {k,n,N) obeying the asymptotics H7.1|l . an overwhelming fraction 
of the k — 1 faces F of C induce k — 1 faces AF of AC. 
P — AT is at least fc„-neighborly. 

7.4. Correcting random patterns of k errors or fewer. Proof of (1.3). Let 

llzjlo count the number of nonzeros in z. 

Definition 7.8. The random vector z is a symmetric k-sparse random vector if 

• P{z e fi} = P{—z G r^} for all measurable sets fi; and 

• P{\\z\\o < k} = 1. 

Suppose that the received message w = B^u + z where u is arbitrary and z 
is a symmetric fc-sparse random vector stochastically independent of A,B. Define 
y = AfjL and consider the resulting instance of (Pi ) . Then, conditional on each fixed 
realization of z, put xq := z and apply Theorem 17. 71 to get that 

E{fk^i{AC)\z}>{l-e)fk-i{C). 

implies 

P{xi = z\z} > 1 - e. 

By independence of z and A, 

E{fk-i{AC)\z} ^ Efk-i{AC). 
Apply now Corollarv l7.3l to infer (1.3) □ 
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7.5. Empirical Results. The phenomena uncovered by Theorems 11.21 and 1 1 . 41 can 

be observed empiricaUy. For a given {S,p) pair, pick a large N, generate a random 
A of dimensions n = \_SN\ by N, and check whether for k = [p • nj , a randomly 
chosen fc-face F of Q, yields a projected simplex AF that is also a face of AQ; here 
Q — j^N-i Q _ (jN _ This can be verified by linear programming. 

Let F be a fc- 1-face of Q = T^'^ or C^. Then the elements of F have nonzeros 
in only k coordinates. HQ — T^^^ the nonzeros are nonnegative; if Q = the 
nonzeros have a definite sign pattern particular to the interior of F. In the following 
result, let xf denote the barycenter of the face F, and let {V) denote problem (Pi) 
if Q = or problem [LP) if Q = T^-i, where 

{LP) min V x subject to y = Ax, x>Q. 

Theorem 7.9. I13| Let Q — T^^^ or . The following statements about a 
face F of Q are equivalent. 

• AF is a face of AQ . 

• Let yp — Axf ■ Then xf is the unique solution of the instance of {V) 
defined by {yF,A). 

Thus, to check Theorems 11.21 and 11.41 one checks that for a randomly-generated 
vector xo with k nonzeros, the corresponding vector y = Axo generates an instance 
of either {LP) or (Pi) uniquely solved by xq; {LP) corresponds to xo > and 
Q = T^~^ whereas (Pi) corresponds to xq with entries of either sign and Q ~ C^. 
If such uniqueness holds, we call that experiment a success. Theorems 11.21 and II .41 
imply that for k below a given threshold, success is very likely while above that 
threshold, success is very unlikely. 

We conducted 44,000 such experiments with the common value N — 10,000, 
exploring the {S, p) domain as follows. We considered n = 10, 15, 20, . . . , 100; for 
each value of n, eleven values of the sparsity, fc, were chosen near the asymptotic 
thresholds, n ■ |2 log(5)|~"'^. At each combination of k and n, two hundred random 
problem instances were generated. 

Figures !?. II (a)- (c) summarize our results. A region of the {6, p) plane is decorated 
with a shaded attribute depicting the fraction of successful experiments. Figure 
IT.lf a') shows the simplex case, along with the threshold p^r{S) and its asymptotic 
approximant, |2 log((5)|~^. Figure mT b) shows the cross-polytope case, with the 
threshold p^{S) and the approximant, |2 log((5)|~-^. To better highlight the (subtle) 
difference between the simplex and cross-polytope cases. Figure IT.lf c) shows the 
fraction of cases where the simplex experiments were successful and the cross- 
polytope experiments were not. 

Figure mi fa'l and (b) display a remarkable match between the thresholds p^{S), 
p^{S) and their asymptotic approximations, |21og((S)|~^. Both curves track the 
observed empirical phase transition. This empirical transition is of course not a 
true discontinuity, because we are working with finite problem size N — 10, 000; 
instead it is a relatively abrupt change. Still, some relatively sharp distinctions 
can be made; there is a definite region where the simplex experiment is typically 
successful but the cross-polytope experiment is not - see Figure mi fc). 

For S near 1/100 the empirical transitions at TV = 10, 000 show a clear agreement 
with the theoretical thresholds pw{S) and the |21og(5)|~^ asymptotic approximant. 
Fixing the region 6 £ [1/1000, 1/100] explored in Figure mi and increasing TV offers 
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better resolution in fc/n; the sharper empirical transition is again in agreement with 
the theoretical thresholds. 

7.6. Exponentiality. A key element of our proofs, which we have not emphasized 
in the formal statement of our theorems, is that the rate of approach to all the 
limits of interest is exponential in the proportional growth case. Thus, we have 
shown that, for p < p%{S), there are positive numbers <5, p) yielding 

fk{Q) - EfkiAQ)} < N^^ exp(-V2iV). 

In all our proofs can be found explicit calculations of these exponents and remain- 
ders. Similarly, we have shown that, for p < there are positive numbers 
'4>i{*,S,p) yielding 

Jk{Q) 

In the non-proportional growth case, analogous relations hold, provided we consider 
triples {kn,n, Nn) along a trajectory fc„ — [r^(ri/A^„; r) • n\, with r > 2e or A;„ = 
[r^(n/iV„;r) • nj , with r > 2, Theorem [131 

In fact our results are strong enough to yield explicit values effective at moderate 
{k, n, N). The following two Theorems follow directly from equations (|5.9|l . H5.14|l . 
H5.19|l . H5.20|l . H5.25|l and (|5.2t)|) . The notations i>, i>, etc. are defined in those 
equations. 

Theorem 7.10. 

/fe(T^-i) - EMAT^-') < {N + 3fexp {N - *+ ^(^,7) - *+,(^,7)]) 

and 

/fc(C^) - EMAC'') <iN + 3)5 exp {N [*±„(j.,7) - ^f.ti^) - ^Lii>,l)]) , 
each uniformly over k — 1, 2, ... n and n = 1, 2, . . . iV — 1. 
Theorem 7.11. 

and 

EMAC^) 
hi 

each uniformly over k — 1, 2, ... n and n = 1, 2, . . . iV — 1. 



> l_(iV+3)"/2exp (n [{^to^ - - ^ttiP) - *L(^,7)]) 



Equipped with these explicit bounds, we can make nonasymptotic bounds an- 
swering a variety of interesting questions: 

Ql f Setting II. 1.1|1 : At a particular choice of N and n, for what values of 
k is there a positive chance that a standard Gaussian point cloud has a k- 
neighborly convex hull? Similarly, consider the symmetrized Gaussian point 
cloud with TV points {ai, — ai, 02, —02, . . . , ajv/2, — 0^/2} (and the a^iid stan- 
dard normal). For what values of k is the resulting convex hull fc-centrally 
neighborly? 
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Figure 7.1. Panel (a) Success fraction, simplex; Panel (b) Success 
fraction, cross-polytope; Panel (c) Fraction successful for Simplex 
but not for Cross-Polytope. N = 10, 000. 
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Q2 (Setting 1.1.2): At particular values of N and k, what values of n are 
associated with at least a 99% success rate in recovering a fc-sparse object 
from n random questions? Similarly, what values of n are associated with 
at least a 99% success rate in recovering a /c-sparse object from n random 
questions supposing we know that the fc-sparse object is nonnegative? 
These questions can be answered by establishing the bounds 
fk{T^-')-Efk{AT''-')<l, or /..(C^) - < 1, for (Ql) 

and 

for the given (fc, n, N) of interest. Simply plugging in the expressions in Theorems 
17.101 and 17. Ill we immediately get bounds of the required form. 

Since we have developed a series of computational tools to evaluate the '^n^t and 
related quantities, it is rather easy for us to numerically compute nonasymptotic 
bounds answering Q1-Q2. 

Figures [7!^ a-b') are relevant to Ql. They show the unit level set Bound{k, n, N) = 
1 for the bounds in Theorem ITTTTl for N = 200, 1000, and 5000. 

Corollary 7.4. For a given N e {200, 1000, 5000}, consider values of k and n such 
that {n/N, k/n) lies strictly beneath the curve corresponding to that N depicted in 
Figures FT^ aV There exist nxN matrices A such that AT^~^ is fc-neighborly. They 
can be obtained with positive probability by random sampling from the standard 
Gaussian distribution. 

Consider values of k and n such that {n/N, k/n) lies strictly beneath the curve 
corresponding to that N depicted in Figures FT^ b^. There exist n x N matrices A 
so that AC^ is centrally fc-neighborly. Such matrices can be obtained with positive 
probability by random sampling from the standard Gaussian distribution on R"^^. 

□ 

Figures \7.'2{ c-d) are relevant to Q2. They show the domain in the phase dia- 
gram in which, on average, at least 99% of faces survive the prescribed dimension 
reduction. 

Corollary 7.5. For a given A'^ S {200, 1000, 5000}, consider values of k and n such 
that {n/N, k/n) lies strictly beneath the curve for that N depicted in Figures FTW cV 
Fix a given face F of T^~^ independently of A. There is at least a 99% chance 
that AF is a face of AT^~^ . Again, we refer to A generated by random sampling 
from the standard Gaussian distribution. 

Consider values of k and n such that {n/N, k/n) lies strictly beneath the curve 
for that A^ depicted in Figures [^Jd) . Fix a given face F of independently of 
A. There is at least a 99% chance that AF is a face of AC^ . Here probability 
refers to random sampling from the standard Gaussian distribution on R"^^. 

Due to the exponentiality of the bounds in Thcorem l7.11l there are no perceptible 
changes in Figure 17?^ when the specified levels used in calculating those figures are 
changed, ie. if we changed to 50% success from 99% success rate in panels (c-d), 
the figures would not change substantially. 

It should also be noted from Figure \T7^ that even for small A^, say 200, when 
5 = n/N is relatively large there is already a large region below the level curves. 
However, for A^ and n/N simultaneously small, our bounds become weak or useless. 
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For instance, the N — 200 contour in Figure FT^ a) reaches zero at about n/N = 
1/20, corresponding to n = 10. 

In fact, the bounds we presented in Theorems 17. lUI and 17. 1 II do not indicate the 
full power of our approach. Those bounds, in fact, are presented here because they 
follow immediately from what has been done above, and they seem easy for readers 
to digest. For projections to very low dimensional spaces, the polynomial factors of 
the bounds provided in Theorems 17 . 1 01 and 17 . 1 11 become important; to go beyond 
the work reported here, care must be taken in combining equations (|5.9|) . I|5.14|l . 
()5.19|l . H5.20|l . H5.25|l . and (|5.26|l to arrive at (|2.2|l : and also perhaps in sharpening 
the underlying remainder estimates. 




Figure 7.2. Panels (a) and (b): Unit level curves 
(\Bound{k,n, N)\ = 1) for the upper bounds in Theorem 
ITTTTI for N=200 (blue), 1000 (green), and 5000 (red); Q = T^-^ 
(left) and Q = (right). The asymptotic, iV ^ oo, hmits pg 
are shown in black. Panels (c) and (d): The 1/100 level curves 
for the lower bounds {\Bound{k,n, N)\ = 1/100) in Theorem 17. Ill 
again for N=200 (blue), 1000 (green), and 5000 (red); Q = T^-^ 
(left) and Q — (right). The asymptotic, ^ oo, hmits 
are shown in black. 
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7.7. Relation to Other Work. We discussed face-counting related work in the 
body of the text as the opportunity arose. We now mention several categories of 
related literature. 

7.7.1. How Neighborly can a Polytope Be? Theorems 11.31 [131 and 11.61 imply the 
following. For N ^ n, both large, and N subexponential in n, there exist polytopes 
P which are: 

• k -neighborly with 

n 



2elog(Af/(n- v^))' 

indeed, simply take P ~ AT^^^ where A has Gaussian i.i.d. entries, 
centrally k-neighhorly with 



n 



2elog(iV/(n-2V^))' 
indeed, simply take P = AC^ where A has Gaussian i.i.d. entries. 
Recently, the problem of showing the existence of high-dimensional neighborly 
polytopes has attracted a resurgence of interest. After fundamental work in the 
1950's-1970's starting with D. Gale [T71 and extending through P. McMuUcn 
and G.C. Shephard [^Hl and R. Schneider j^D], the subject was very quiet. Now, 
as Schneider wrote one of us, "the subject has come to life again" . Our own work 
[T^ [TSl [TUl [TT] carefully studied the questions of neighborliness and central neigh- 
borliness of projections of random polytopes in the proportional growth setting. 
Our attempt was to characterize the exact location of the asymptotic phase tran- 
sitions associated with strong and weak neighborliness. Linial and Novik [53] gave 
exponential bounds on the probability that AC^ is centrally neighborly; note that 
Rudelson and Vershynin's work [28] came earlier and implies similar bounds by 
duality. Both [^ [5S] use a geometric functional analysis approach which gave 
inequalities akin to, 

P{fk{AC) ^ fkiC)} < V'l exp(-^2n), n > no, 

valid for k < cn/\og{N/n) with unspecified constants. As our paper was nearing 
completion, we learned that Rudelson and Vershynin [2H| had been able to supply 
specific constants. Their result is as follows. 

Theorem 7.12. [Rudelson and Vershynin [Theorem 4-.1]] Fix {k,n,N). Let A 
be an nx N random matrix from the standard Gaussian distribution. Let xq be the 
vector with k nonzeros and let y — Axq. Let D, be the event {The instance of (Pi) 
defined by {y,A) has xq for its unique solution}. Then 

2 



(7.2) P{n) > 1 - 3.5 exp i^- ^V" - N)^ /18 

where 

(7.3) m{k,N) < ciklog{c3N/k){l + o{l)) 
with ci := 6 + 4V2 « 11.66 and C3 := e^/^ « 4.48. 

This striking result illustrates the ability to obtain explicit constants using ex- 
isting approaches from geometric functional analysis; the proof is admirably short. 
Moreover, it opens the important question of getting explicit results in the finite- 
sample non-asymptotic case. When we learned of this result, we decided to include 
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here a quantitative comparison, illustrating the relative strengths of our different 
results. Theoreni l7.12l can be recast in a form similar to the cross-polytope portion 
of Theorem Eini 



(7.4) 



Vm(fc,7V) /18 



While this result has the appearance of an explicit, finite- result, note the 
term o(l) in 17.3|l which needs to be converted into an explicit numerical term to 
enable concrete finite- comparisons. It seems that one can bound this by « 1/10 
for N > 1000. Rudelson and Vershynin have informed us of upcoming explicit 
bounds on the o(l) term. It appears that replacing the o(l) term by zero in the 
definition of m(k, N) gives a lower bound on the actual expression developed by 
Rudelson and Vershynin. Placing this lower bound for m in the right side of (|7.4|) 
yields a lower bound on the right hand side of Rudelson and Vershynin's upper 
bound. We now make some comparisons between our actual upper bounds and this 
lower bound on Rudelson's and Vershynin's upper bound. To be fair, Rudelson 
and Vershynin's interest was in simply obtaining reasonable bounds by geometric 
functional analysis, which is rather different from our focus here. 

Figure 171^ illustrates how H7.4(l compares to the finite-sample bounds developed 
by techniques of this paper in Subsection l7.6l The curves hugging the bottom of the 
display are those implied by Rudelson- Vershynin's Theorem l7.12l when we replace 
the o(l) term by 0; the much higher curves are those implied by our Theorem 17. lUI 
The streamlined appearance of the bounds in Theorem 17.121 come at the cost of a 
remarkably small region of effectiveness. For instance, the curves associated with 
P{fk{AC^) = /fc(C^)} > which follow from Theorem rm do not exceed 0.0151 
for any N; whereas, Figure [731 illustrates that for even a modest N = 200, this same 
probability level following from Theorem 17. 101 is exceeded well before n/N = 1/6. 




Figure 7.3. Left: Unit level curves {\Bound{k,n, N)\ = 1) for 
the upper bounds in equation (|7.4() (blue) associated with Theorem 
17.121 and the cross-polytope, , portion of Theorem l7.10l fblack). 
Curves present the cases 7V=200 (dot-dash), 1000 (dot), and 5000 
(dash). The asymptotic, N oo, limit is shown in solid. 
Right: Enhanced bottom left portion of left panel. The upper 4 
curves come from our approach; the lower three curves come from 
the results of Rudelson and Vershynin |29|. 
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7.7.2. How Many Projections are needed to recover a k-sparse object? One reason 
that study of neighborliness "has come to life again" is the surprising imphcations 
for speeding up key processes in medical imaging and proteomics. The general idea, 
often labeled Compressed Sensing 8 is that images, spectra, and other real-world 
objects are highly compressible, and that this compressibility makes it possible to 
reconstruct such objects accurately from relatively few carefully chosen generalized 
samples. In effect Section 1.1.2 has described an abstract model of compressed 
sensing. 

In the application scenario, Xq represents the coefficients of an image to be 
acquired and the rows of A represent a random set of of linear combinations (mea- 
surements) which will be used to reconstruct xq. In effect, we are saying that if xq 
has N pixels but only k <^ N nonzeros in (say) a wavelet basis, and if k and N are 
large, then we only need n measurements, where 

n > 2k\og{N/n){l + o{l)). 

In contrast, N is the 'standard' number of samples; the point is that for objects 
which are fc-sparse with k small, we can easily have n <C iV if xq is highly sparse. (In 
fact real objects will not exhibit such strict sparsity - k zeros and N — k nonzeros 
- but because the (Pi) has an £i stability property [S], we can pretend that this is 
so without distorting the problem.) 

The interested reader may pursue the papers of Candes and collaborators, 
other theoretical work |2HI 1201 1^ and much recent applied work [32 ^5 ■ 

The quantitative approach developed here is precise about how much data would 
be needed. Most of the cited theoretical work is qualitative, often leaving the con- 
stants unspecified. An important point: in Section 1.1.2 and in (1.2) we are studying 
the equivalent of weak central neighborliness. We argued in Section 17.51 that this 
is the empirically relevant notion, we repeat here that ordinary (strong) central 
neighborliness is simply not empirically observable. Nevertheless, most authors 
have effectively studied implications of ordinary (strong) central neighborliness. 
That notion is hard to analyse, and appears to indicate a far more pessimistic view 
of what is possible than what one actually observes in practice. 

7.7.3. Fast Decoding of Error- Correcting Codes. In general, decoding of linear error- 
correcting codes is NP-hard JSl- However, fast decoding of specific error-correcting 
codes has been an object of great practical and theoretical attention over the last 
10 years, with great advances in turbo codes and in LDPC codes (Gallager Codes). 
We proposed in Section 1.1.3 above a simple scheme for fast decoding of random 
linear codes over R using (Pi). The scheme we proposed is equivalent to one pro- 
posed by Candes and Tao and studied further by Rudelson and Vershynin [2H|- 
Using the notation of our Section 1.1.3 their decoder solves the i'l-minimization 
problem 

min \\w — P"^u|| 1, 

u 

yielding the reconstruction ui, say. The equivalence of such minimization with the 
one proposed in Section [1.1.31 is shown in [Section 8]. 

It is of course crucial to know how many errors such a scheme can correct. The 
theoretical literature (Candes- Tao/Rudelson- Vershynin) gives qualitative results, 
saying that one can correct at least cn/\og{N/n) errors, with c left unspecified, or 
else specified as a constant which seems much smaller than what would be expected 
based on a comparison of those papers' results with the results obtained here. 
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The problem solved in those visionary papers is to show existence of integer- 
valued matrix pairs A, B allowing block coding of messages of length m as blocks 
of length TV, such that all patterns of at most k errors can be corrected, here 
n = N — m. Our results here change the problem so that A and B are generated 
by partitioning a uniformly-distributed random projection matrix (n.b. not with 
integer- valued entries); with this change, we get a precise asymptotic formula k = 
np^(n/N){l + Op{l)). For the case N ^ n we have proven the formula pg(n/N) ~ 
l/2elog(iV/(nV^)). 

If we change the problem again slightly so that the goal is to correct nearly all 
rather than all error patterns, then for the case of long block codes, we get a precise 
asymptotic formula k = np^{n/N){l + op{l)). For the case ^ n we have proven 
the formula p^{n/N) ~ 1/2 log(A^/n). 

Conceivably, such results for the "changed problems" we just mentioned may 
be better than for the original problem; i.e. the situation for general random 
matrices may be more optimistic than for matrices with integer entries. However, 
our empirical results with Rademacher random matrices indicate that our formula 
np^{n/ N) accurately describes the integer- valued case as well, i.e. accurately 
describes the number of errors which can typically be corrected by such random 
matrices with integer-valued entries. 



Appendix: Proofs of Key Lemmas 

7.8. Proof of Lemma 14.11 We develop (|4.7|l in two stages. Initially, we derive 
the asymptotic behavior of s.^ as 7 — *■ 0; we then substitute that into equation 
(I2.2(J|I . To motivate our approximation of we use an asymptotic series for R{s) 
appropriate for the regime of s large, 

s'-/2 r ~vV2, , 1 1-3 1 • 3 • 5 1 • 3 • 5 • 7 
R{s) := se' e ^ '^dy + — — + + 

This is derived as follows. The ratio R{s) — s ■ Mills{s) for s > 0, where Mills{s) 
is the usual Mills' ratio for the normal distribution. The corresponding asymptotic 
series for Mills' ratio is developed in [23, Sees 5.37,5.38]; H. Ruben [23 credits this 
series to Laplace. 

In Eq. (5.106)] it is shown that the error in truncating the series for Millsi) 
at the s-th term is at most as large as the s-th term itself. i?() inherits this property. 

It is now convenient to define L(s, 7) :— R{s) — 1 + 7 and note that is defined 
by i(s7,7) = 0. 

Keeping the first two terms in the series expansion for R{s) and applying the 
bounds from |23| Eq. (5.106)] yields L(s,7) = 7 - s^^ + 3^-4 + 0{s-^), which 
suggests the approximation, 

(7.1) « ~s, := 7-^/^ - ll'/'. 

To quantify the error in this approximation, invoke the mean value theorem; 
given a smooth function F{x), there is always a point w G [min(a;, y), max(a;, y)] 
satisfying 

(7.2) F{y)^F{x) + {y-x)^F{y)\y^^. 
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Hence we can bound \y — x\ if we have suitable bounds on \F{y) — ^'(a;)| and 
-^F{y)\y^^. Apply this principle to F{s) — L{s,j) about s^, getting 



(7.3) 



S-y — Sj\ < 



for some point s,nid G [niin(s-y, s.^), niax(s-y, S'y)]. 

The following bounds follow from 23, Eq. (5.106)] 



(7.4) 1 - s- 

yielding in turn 
(7.5) 



< R{s) < 1 - + 3s-4 



for s > %/30, 



1^(^7,7)1 < ^7', for 7 < 1/30. 



To bound the denominator, note that 
d 



ds 



L(s,7)-[.s + .s-i]-i?(s)-s, 



which is a positive decreasing function of s; this attains its lower bound on the 
interval s G [min(s-y, s^), max(s^, s^)] at one of the endpoints {s-y,s^}. At s-y we 
again make use of the lower bound on Mills' ratio in equation (|7.4|l 



d_ 



^(s,7)| 



(7.6) 



3 5 ^_ c: 



3/2 



For the lower bound at we assume \sj — < 57^^^ (which we will verify 
momentarily), which gives the upper bound s-y < 7"^/^ — 7^/^. From this we have 
the lower bound. 



d_ 



[s^ + ^] • (1 - 7) - = (1 - 7)s^ 1 - 7s, 
1 



> (1-7) 



7-1/2 _ ^1/2 



7(7^ 



7 '"7 
1/2 _ ^1/2) ^ ^3/2 



Using these bounds in equation (|7.3|l we have 
(7.7) 



\s^-~s^\<lj^/^, for 7 < 1/30, 



which justifies the earlier claim that js^, — s^j < ^7^^^- For the following calculations 
the following estimate sufhces, 



(7.8) 



-7-1/24.^^(7)^ with |ri(7)| < 27^/^ for 7 < 1/30. 



Combined with H2.20|l . this gives H4.7|l and hence, Lemma [4. II 



□ 
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7.9. Proof of Lemma I4.2L We first motivate our approximation for x^, which 

solves 2xQ{x)/q{x) = 1 — i^^^. The trmicated asymptotic series 

2xQ{x) 



q{x) 



27T^/^xe^ -l + 0{x-^), 



X oo, 



suggests approximating the solution to 



(7.9) 



xe 



iTT-i/^z.-! = 0; 



this is exactly of the form H4.9|l with z — z'^ :— {21/^/^)^^. Our approach for 
approximate solution of (|4.9() . carried out to two stages, yields the approximant a;^, 
obeying: 



(7.10) 



xl := logz+ - i log log 



Our claim that accurately approximates x^, as v ^ 0, as stated in Lemma 14.21 
will be supported by arguments similar to those used in proving Lemma l4. II 
Let 



(7.11) 



J{x, v) := 2xe'' 



e ^ dy + 1 — v 



whose level curve J{x^, ly) = defines x^. To bound the error in the approximation, 
x^, we again use the mean value approach H7.2|l . getting 



(7.12) 



/ d 



for some point Xmid G [min(a;,y, i,y), max(x,y, i,y)]. The magnitude of J{xu,v) can 
be bounded by 

\J{x^,v)\ ^ -J{S:^,iy) ^ -2Tr'^/^S:^e^^^ +re{v) 
< -27ri/2i^e*' + I'-S J/ < 1/10 



(7.13) 



(7.14) 



l_ I l_ 1 loglogz+ 
2 log 2+ 



_i 3 log log 2:+ 
logz+ 



1/2' 



where the transition from the first to second line utilizes rg (i^) := 2x,ye^^ e ^^dy— 
1 < for 1/ < 1/10. 

Turning to the denominator in H7.12|l . we observe that on the half- line a; > the 
derivative is a positive increasing function, 

d 2 
-J{x,y) = 2a; + 2(1 + 2a;^)e^ 



dx 



(7.15) 



1 -I- 2a;^ 



dy 



a lower bound for dJ/dx over [min(a;^, a;^), max(a;^, i^)] is attained at one of the 
endpoints At Xjj Si simple lower bound is 

d 1 + 2x'^ 
(7.16) j^Jix, v)\x=x,. = -v^'^ - > 2x^v-^ for v <l. 

OX X ly 
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A similar lower bound holds at 



d_ 

dx 



1 + 2x 



^■2 



> 



^ 1 log log z+ 



(7.17) > x^iy 

Combining (f7?TB|l and ifTTfjl . 



2 log z+ 
l + (logz+)-i- 
fori/<l/4. 



XijV ^ + Xi,V ^ 



[by irmji 

log log z+ 
logz+ 



(7.18) 



d_ 

dx 



J{x,v)\^=^^^^>v ^min(a;i.,J^); 



although crude, this is sufficient for later purposes. 
Shortly we will prove there is > Q such that 



nwni^x^.Xi,) > -Xi,, 



(7.19) niin(a:;^,5:^) > -Xi,, < < vq. 

Substituting H7.14|l and (|7.18|) into equation (|7.12f) gives 

(7.20) 



, I . 1 _iloglogz+ 

< -x^ — — . 

2 log z+ 



Lemma [4.21 follows by simple substitution of terms. 

We now show (|7.19|l . Recall that e^^ dy = -^/tF. Hence on a; > 0, J{x, v) < 
J(x, v) := 2^xe^ — As J{x, v) is monotone increasing on (0, oo) it follows 
that J{x' ^ v) < Q implies x^ > x' . 

We now show that if < a < 1, then 



(7.21) 



J{ax^, ly) < 



for 1/ sufficiently small. Setting a ~ 3/4, this will imply J{jx^,i^) < for all 
sufficiently small i^, and so, for such i/, min{xij, x^j) > (|7.19|) follows. 
Proceed thus: 

J( ~ \ or- aVlog(^+) - 1/2 log log z+ ^ 2 
■Hax.,iy) = 2^ \og{z+y'/2 +^ 

= 2V^a ■ l0g(j.-l)(l-'^')/2(^ ^ . ^-a- ^ ^ _ 

= o(!y"^) + 1 - i/^O. 

(Trnil follows. 

7.10. Proof of Lemma 14.41 We will show that 



-1 



□ 



(7.22) 



d 



A/[— ,](5,r+(<5)) < - log ( - ) + 0(1), S ^ 



2e 



Because r > 2e, the leading term on the RHS is a negative constant, showing that 
for small enough 6 the function '^^et monotone decreasing in ly on the admissible 
domain, implying the assertions of the Lemma. Now 

— (v 'y) = —(-^^ - - 

netVj // dv ""^ tnt ^ extJ 



logx^ + ^ log(47r) + H{j) - (1 - 7) 



log ( 



V 7 



^log(2.) + ^ 
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Over the interval v G [(5,1), the first component, logXj/, is largest aX v = 6. 
Applying Lemma 14.21 we have 



log < log xs 



(7.23) 



1 



log log z J +log 



1 log log 

2 log z+ 



1/2 



+ r3(J)(logz+)-i/2 



< 2^°SlogZ5^ 



for (5 < 1/50. 



The RHS of H7.23|l is an increasing function of 7, maximized at 7 = rg{6). Using 
Lemma [4. II gives, for all 7 small enough: 



Hil) - (1 - 7) 



log 



+ i log(27r) + 



27 



(7.24) 



< ^ log 7 + ^ log(e/27r) + 67 - ^7 log 7 



= - log 
2 ^ 



2'KT log Zg 

Combining fT^ - fT^ yields (|7:^ 



O 



/ log log 
V log 2;/ 



, (5^0. 



□ 



7.11. Proof of Lemma 14.51 It is sufhcient to show that for some 70 > 
d 



(7.25) 
Now 



97 



KeMl)>^/^, t^e['5,l), 0<7<7o. 



d_ 

97 



- 1 

7-1 - 2 log 7 + log(2/^) + ^i^yl, + log(l - 7) 



(7.26) 



+llog(27r) + l + (l-7)^('^- ^ 



7 y7(i - 7)/ c?7' 

Lower bounds for each but the last term follow either directly or from Lemma l4. II 
For 7 < 1/10, y~^ satisfies 

(7.27) _Z!^!._473/2>y/2(i_3^) 



and 



1-7 

^+473/2 < 71/2(1 + 67), 
1-7 



from which follow both 



and 



72-1 



log 2/7 > 2^°g'^^4T' 



-1 



-1 



2 > + 67)^(1 -7')t^:^(1 + 167), 



2^2 ^7 27 ' " ^ ' ' 27 
respectively. The last term in H7.26|l requires estimating 

7(7 - 1) 



rf7^" (1-7)2 



^si + 7 - 1 



From 



7(7 - 1) > 75^ + 7 - 1 > 47^ - 37 > 47(7 - 1) 
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yielding 



(7.28) 



d 3 



for 7 < 1/30. As the above quantity is positive, a lower bound for the last term in 
II7.26|I is obtained with a lower bound on its multiplicative factor, 

1 



(1-7)^ 



>_8y/2(i_^)2 



7 2/7(1-7) 
which is obtained from (|7.27|) . With H7.28|l we arrive at. 



(1-7)^ 



7 2/7(1 - 7)/ dj 
Combining these bounds we have that 



y.y> -6- 87 for 7 < 1/30. 



(7.29) 



d_ 

97 



i7-i + ^log(l/7)-13-147 



for 7 < 1/30, the term in brackets exceeds 1/2. (|7.25|l follows. 

7.12. Proof of Lemma 14.61 We will show that over the admissible domain. 



□ 



(7.30) 



d_ 

dv 



V ^ net ^ face/ ^ 



■log 



0(1) 



As T > 2, this proves the Lemma. For sufficiently small S, we have the inequality 



d_ 
dv 



(\r)+ _ ) 

\ net faeey 



d_ 

dv 



(*net ) + 7 log + 7 log 7 - 7 log(l - vi) 



(7.31) 

say. We will show that 

M[m,rU5))< 



< log + 7 log + i log 7 + i log(2e) + 87 + ^7 log 7 



■log 



<S < So; 



this implies H7.30|l . Wc first note that 



d_ 

d'-f 



17 1, 1-1 
logi^ + Y + 2log7+ 27 



(7.32) 



> log (5 

'T 



17 1 



log(7) 



1 



> 



- 1 



1 



log(l/<5)--loglog(l/5) 



y-2log(r) 



which for any r > 2 becomes arbitrarily large as d approaches zero. As a result, 
n{i>,"f) obtains its maximum where 7 is largest within the admissible domain, i.e. 

'wi 



at 7 = r^{d). To find the overall maximum we now examine the f direction along 
7 = r-+(<5): 



(7.33) 



1 



1 



r log 1/5 1 + 2x2 



/,From Lemma r4.2l it follows that for any r > 2, for 8 sufficiently small, 
Tlog(l/(5) > 21og(l/(5) > 2x^ + 1 - (5 > 2a;^ + 1 - 
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for v ^ [S,l). As a result H7.33|l is negative for S sufficiently small, indicating 
that the maximum of 7) over the domain of interest is obtained at {S, r^{S)). 
Moreover, 



1 



2e 



1 



nil., 7) < nis, p)<- log \-j -- + 

giving (|7.3U|) . 

7.13. Proof of Lemma l4?fl ^From (fT^ and 



log logz+ 
log 2:+ 







d 

^*/ace('''7) = -V [l0g7 + \0gV~ log(l - 1/7)] 



we have the lower bound, 



d 



-7-1 + log + - log(l/7) - 13 - 147 



--1 log(l/5) + -loglog(l/<5)-13 



□ 



^(*„et-*7ace)('^:7) > 

> 5 



+ -l0gT-14[Tl0g(l/5)]-l 

with the last inequality due to 7 < r'^[5). For any r > 2 the above bound is 
positive for 5 sufficiently small. 

□ 
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